Natural products in drug discovery: advances and opportunities
Atanas G. Atanasov, Sergey B. Zotchev, Verena M. Dirsch et al.
Research Article — Peer-Reviewed Source
Original research published by Atanasov et al. in Nature Reviews Drug Discovery. Redistributed under Open Access — see publisher for license terms. MedTech Research Group provides these references for informational purposes. We do not conduct original research. All studies are the work of their respective authors and institutions.
Natural products and their structural analogues have historically made a major contribution to pharmacotherapy, especially for cancer and infectious diseases. Nevertheless, natural products also present challenges for drug discovery, such as technical barriers to screening, isolation, characterization and optimization, which contributed to a decline in their pursuit by the pharmaceutical industry from the 1990s onwards. In recent years, several technological and scientific developments - including improved analytical tools, genome mining and engineering strategies, and microbial culturing advances - are addressing such challenges and opening up new opportunities. Consequently, interest in natural products as drug leads is being revitalized, particularly for tackling antimicrobial resistance. Here, we summarize recent technological developments that are enabling natural product-based drug discovery, highlight selected applications and discuss key opportunities.
Abstract
Natural products and their structural analogues have historically made a major contribution to pharmacotherapy, especially for cancer and infectious diseases. Nevertheless, natural products also present challenges for drug discovery, such as technical barriers to screening, isolation, characterization and optimization, which contributed to a decline in their pursuit by the pharmaceutical industry from the 1990s onwards. In recent years, several technological and scientific developments — including improved analytical tools, genome mining and engineering strategies, and microbial culturing advances — are addressing such challenges and opening up new opportunities. Consequently, interest in natural products as drug leads is being revitalized, particularly for tackling antimicrobial resistance. Here, we summarize recent technological developments that are enabling natural product-based drug discovery, highlight selected applications and discuss key opportunities.
Introduction
Historically, natural products (NPs) have played a key role in drug discovery, especially for cancer and infectious diseases 1 , 2 , but also in other therapeutic areas, including cardiovascular diseases (for example, statins) and multiple sclerosis (for example, fingolimod) 3 – 5 . NPs offer special features in comparison with conventional synthetic molecules, which confer both advantages and challenges for the drug discovery process. NPs are characterized by enormous scaffold diversity and structural complexity. They typically have a higher molecular mass, a larger number of sp 3 carbon atoms and oxygen atoms but fewer nitrogen and halogen atoms, higher numbers of H-bond acceptors and donors, lower calculated octanol–water partition coefficients (cLogP values, indicating higher hydrophilicity) and greater molecular rigidity compared with synthetic compound libraries 1 , 6 – 9 . These differences can be advantageous; for example, the higher rigidity of NPs can be valuable in drug discovery tackling protein–protein interactions 10 . Indeed, NPs are a major source of oral drugs ‘beyond Lipinski’s rule of five ’ 11 . The increasing significance of drugs not conforming to this rule is illustrated by the increase in molecular mass of approved oral drugs over the past 20 years 12 . NPs are structurally ‘optimized’ by evolution to serve particular biological functions 1 , including the regulation of endogenous defence mechanisms and the interaction (often competition) with other organisms, which explains their high relevance for infectious diseases and cancer. Furthermore, their use in traditional medicine may provide insights regarding efficacy and safety. Overall, the NP pool is enriched with ‘bioactive’ compounds covering a wider area of chemical space compared with typical synthetic small-molecule libraries 13 . Despite these advantages and multiple successful drug discovery examples, several drawbacks of NPs have led pharmaceutical companies to reduce NP-based drug discovery programmes. NP screens typically involve a library of extracts from natural sources (Fig. 1 ), which may not be compatible with traditional target-based assays 14 . Identifying the bioactive compounds of interest can be challenging, and dereplication tools have to be applied to avoid rediscovery of known compounds. Accessing sufficient biological material to isolate and characterize a bioactive NP may also be challenging 15 . Furthermore, gaining intellectual property (IP) rights for (unmodified) NPs exhibiting relevant bioactivities can be a hurdle, since naturally occurring compounds in their original form may not always be patented (legal frameworks vary between countries and are evolving) 16 , although simple derivatives can be patent-protected (Box 1 ). An additional layer of complexity relates to the regulations defining the need for benefit sharing with countries of origin of the biological material, framed in the United Nations 1992 Convention on Biological Diversity and the Nagoya Protocol, which entered into force in 2014 (ref. 17 ), as well as recent developments concerning benefit sharing linked to use of marine genetic resources 18 . Fig. 1 Outline of traditional bioactivity-guided isolation steps in natural product drug discovery. Steps in the process are shown in purple boxes, with associated key limitations shown in red boxes and advances that are helping to address these limitations in modern natural product (NP)-based drug discovery shown in green boxes. The process begins with extraction of NPs from organisms such as bacteria. The choice of extraction method determines which compound classes will be present in the extract (for example, the use of more polar solvents will result in a higher abundance of polar compounds in the crude extract). To maximize the diversity of the extracted NPs, the biological material can be subjected to extraction with several solvents of different polarity. Following the identification of a crude extract with promising pharmacological activity, the next step is its (often multiple) consecutive bioactivity-guided fractionation until the pure bioactive compounds are isolated. A key limitation for the potential of this approach to identify novel NPs is that many potential source organisms cannot be cultured or stop producing relevant NPs when taken out of their natural habitat. These limitations are being addressed through development of new methods for culturing, for in situ analysis, for NP synthesis induction and for heterologous expression of biosynthetic genes. At the crude extract step, challenges include the presence in the extracts of NPs that are already known, NPs that do not have drug-like properties or insufficient amounts of NPs for characterization. These challenges can be addressed through the development of methods for dereplication, extraction and pre-fractionation of extracts. Finally, at the last stage, when bioactive compounds are identified by phenotypic assays, significan
Application of analytical techniques
Classical NP-based drug research starts with biological screening of ‘crude’ extracts to identify a bioactive ‘hit’ extract, which is further fractionated to isolate the active NPs. Bioactivity-guided isolation is a laborious process with a number of limitations, but various strategies and technologies can be used to address some of them (Fig. 2 ). For example, to create libraries that are compatible with high-throughput screening, crude extracts can be pre-fractionated into sub-fractions that are more suitable for automated liquid handling systems. In addition, fractionation methods can be adjusted so that sub-fractions preferentially contain compounds with drug-like properties (typically moderate hydrophilicity). Such approaches can increase the number of hits compared with using crude extracts, as well as enabling more efficient follow-up of promising hits 24 . Fig. 2 Applications of advanced analytical technologies empowering modern natural product-based drug discovery. a | An illustrative example of the application of liquid chromatography–high-resolution mass spectrometry (LC–HRMS) metabolomics in the screening of natural product (NP) extracts is the work of Kurita et al. 58 , in which 234 bacterial extracts were subjected to image-based phenotypic bioactivity screening and LC–HRMS metabolomics. Clustering of the resulting data allowed prioritization of promising extracts for further analysis, resulting in the discovery of the new NPs, quinocinnolinomycins A–D. b | Another illustrative example of LC–HRMS screening of NP extracts is the work of Clevenger et al. 85 , who obtained novel NP extracts through heterologous expression of fungal artificial chromosomes (FACs) containing uncharacterized biosynthetic gene clusters (BGCs) from diverse fungal species in Aspergillus nidulans . Analysis of the LC–HRMS metabolomics data with a FAC-Score algorithm directed the simultaneous discovery of 15 new NPs and the characterization of their BGCs. Metabolomics was developed as an approach to simultaneously analyse multiple metabolites in biological samples. Enabled by technological developments in chromatography and spectrometry, metabolomics was historically applied first in other research fields, such as biomedical and agricultural sciences 2 . Advances in the analytical instrumentation used in NP research 25 , 26 , coupled with computational approaches that can generate plausible NP analogue structures and their respective simulated spectra 27 , have also enabled application of ‘omics’ approaches such as metabolomics in NP-based drug discovery. Metabolomics can provide accurate information on the metabolite composition in NP extracts, thus helping to prioritize NPs for isolation, to accelerate dereplication 28 , 29 and to annotate unknown analogues and new NP scaffolds. Moreover, metabolomics can detect differences between metabolite compositions in various physiological states of producing organisms and enable the generation of hypotheses to explain them, and can also provide extensive metabolite profiles to underpin phenotypic characterization at the molecular level 30 . Both options are very useful in understanding the molecular mechanisms of action of NPs. For metabolite profiling, NP extracts are analysed by NMR spectroscopy or high-resolution mass spectrometry (HRMS), or respective combined methods involving upstream liquid chromatography (LC) 31 , 32 , such as LC–HRMS, which can separate numerous isomers present in NP extracts 33 . Moreover, such combined methods might integrate HRMS and NMR, allowing the simultaneous use of the advantages of both techiques 34 , 35 . NMR analysis of NP extracts is simple and reproducible, and provides direct quantitative information and detailed structural information, although it has relatively low sensitivity, meaning that it generally enables profiling only of major constituents 33 . The applications of NMR in NP research are versatile 36 and the technique is used both directly for metabolomics of unfractionated NP extracts and for structural characterization of compounds and fractions obtained with appropriate separation methods, most often LC. HRMS is the gold standard for qualitative and quantitative metabolite profiling 33 and is most commonly applied in combination with LC. HRMS can also be used in the direct infusion mode (called DIMS) 37 , whereby samples are directly profiled by MS without a chromatography step, or in MS imaging (MSI) 38 , which enables determination of the spatial distribution of NPs within living organisms. HRMS enables routine acquisition of accurate molecular mass information, which together with appropriate heuristic filtering can provide unambiguous assignment of molecular formulae for hundreds to thousands of metabolites within a single extract over a dynamic range that may exceed five orders of magnitude 31 , 39 . However, challenges remain in data mining and in the unambiguous identification of the metabolites using various workflows
Genome mining and engineering
Advances in knowledge on biosynthetic pathways for NPs and in developing tools for analysing and manipulating genomes are further key drivers for modern NP-based drug discovery. Two key characteristics enable the identification of biosynthetic genes in the genomes of the producing organisms. First, these genes are clustered in the genomes of bacteria and filamentous fungi. Second, many NPs are based on polyketide or peptide cores, and their biosynthetic pathways involve enzymes — polyketide synthases (PKSs) and nonribosomal peptide synthetases (NRPSs), respectively — that are encoded by large genes with highly conserved modules 68 . ‘Genome mining’ is based on searches for genes that are likely to govern biosynthesis of scaffold structures, and can be used to identify NP biosynthetic gene clusters 69 – 71 . Prioritization of gene clusters for further work is facilitated by advances in biosynthetic knowledge and predictive bioinformatics tools, which can provide hints about whether the metabolic products of the clusters have chemical scaffolds that are new or known, thereby supporting dereplication 72 , 73 . Such predictive tools for gene cluster analysis can be applied in combination with spectroscopic techniques to accelerate the identification of NPs 65 and determine the stereochemistry of metabolic products 66 . Furthermore, to extend genome mining from a single genome to entire genera, microbiomes or strain collections, computational tools have been developed, such as BiG-SCAPE, which enables sequence similarity analysis of biosynthetic gene clusters, and CORASON, which uses a phylogenomic approach to elucidate evolutionary relationships between gene clusters 74 . Phylogenetic studies of known groups of talented secondary metabolite producers can also empower discovery of novel NPs. Recently, a study comparing secondary metabolite profiles and phylogenetic data in myxobacteria demonstrated a correlation between the taxonomic distance and the production of distinct secondary metabolite families 75 . In filamentous fungi, it was likewise shown that secondary metabolite profiles are closely correlated with their phylogeny 76 . These organisms are rich in secondary metabolites, as demonstrated by LC–MS studies of their extracts under laboratory conditions 77 . Concurrent genomic and phylogenomic analyses implied that even the genomes of well-studied organism groups harbour many gene clusters for secondary metabolite biosynthesis with as yet unknown functions 78 . The phylogeny of biosynthetic gene clusters, together with analysis of the absence of known resistance determinants, was recently used to prioritize members of the glycopeptide antibiotic family that could have novel activities. This led to the identification of the known antibiotic complestatin and the newly discovered corbomycin as compounds that act through a previously uncharacterized mechanism involving inhibition of peptidoglycan remodelling 79 . Many microorganisms cannot be cultured, or tools for their genetic manipulation are not sufficiently developed, which makes it more challenging to access their NP-producing potential. However, biosynthetic gene clusters for NPs can be cloned and heterologously expressed in organisms that are well-characterized and easier to culture and to genetically manipulate (such as Streptomyces coelicolor , Escherichia coli and Saccharomyces cerevisiae ) 80 . The aim is to achieve higher production titres in the heterologous hosts than in wild-type strains, improving the availability of lead compounds 80 – 82 . Vectors that can carry large DNA inserts are needed for the cloning of complete NP biosynthetic gene clusters. Cosmids (which can have inserts of 30–40 kb), fosmids (which can harbour 40–50 kb) and bacterial artificial chromosomes (BACs; which can have inserts of 100 kb to >300 kb) have been developed 83 . For fungal gene clusters, self-replicating fungal artificial chromosomes (FACs) have been developed, which can have inserts of >100 kb (ref. 84 ). FACs in combination with metabolomic scoring were used to develop a scalable platform, FAC-MS, allowing the characterization of fungal biosynthetic gene clusters and their respective NPs at unprecedented scale 85 . The application of FAC-MS for the screening of 56 biosynthetic gene clusters from different fungal species yielded the discovery of 15 new metabolites, including a new macrolactone, valactamide A 85 (Fig. 2b ). Even in culturable microorganisms, many biosynthetic gene clusters may not be expressed under conventional culture conditions, and these silent clusters could represent a large untapped source of NPs with drug-like properties 86 . Several approaches can be pursued to identify such NPs. One approach is sequencing, bioinformatic analysis and heterologous expression of silent biosynthetic gene clusters, which has already led to the discovery of several new NP scaffolds from cultivable strains 87 . Direct cloning and heterologous expressio
Advances in microbial culturing systems
The complex regulation of NP biosynthesis in response to the environment means that the conditions under which producing organisms are cultivated can have a major impact on the chance of identifying novel NPs 87 . Several strategies have been developed to improve the likelihood of identifying novel NPs compared with monoculture under standard laboratory conditions and to make ‘uncultured’ microorganisms grow in a simulated natural environment 117 (Fig. 4 ). Fig. 4 Application of advanced microbial culturing approaches to identify new natural products. New strategies for isolating previously uncultured microorganisms can enable access to new natural products (NPs) produced by them. a | To recapitulate the effect of complex signals coming from the native environment, microorganisms can be cultivated directly in the environment from which they were isolated. This concept is used with the iChip platform, in which diluted environmental samples are seeded in multiple small chambers separated from the native environment with a semipermeable membrane. The potential of this approach is illustrated by the recent discovery of teixobactin, a new antibiotic with activity against Gram-positive bacteria 134 , 135 . b | Another important recent development involves obtaining information from environmental samples using omics techniques such as metagenomics to identify and partially characterize microorganisms present in a specific environment before culturing. An approach relying on such preliminary information was recently used to engineer the capture of antibodies based on genetic information, which resulted in the successful cultivation of previously uncultured bacteria from the human mouth 145 . This reverse genomics workflow was validated by the isolation and cultivation of three species of Saccharibacteria (TM7) along with their interacting Actinobacteria hosts, as well as SR1 bacteria that are members of a candidate phylum with no previously cultured representatives. One well-established approach to promote the identification of novel NPs is the modulation of culture conditions such as temperature, pH and nutrient sources. This strategy may lead to activation of silent gene clusters, thereby promoting production of different NPs. The term ‘One Strain Many Compounds’ (OSMAC) was coined for this approach about 20 years ago 118 , but the concept has a longer history 119 , with its use being routine in industrial microbiology since the 1960s 120 . While OSMAC is still widely used for the identification of new bioactive compounds 121 , 122 , this approach has limited capacity to mimic the complexities of natural habitats. It is difficult to predict the combination of cues (which might also involve metabolites secreted by other members of the microbial community) to which the microorganism has evolved to respond by switching metabolic programmes. To account for such kinds of interactions, co-culturing using ‘helper’ strains can be applied 123 . This can enable the production and identification of new NPs, as illustrated by recent studies in which particular fungi were co-cultured with Streptomcyes species 124 , 125 . Study of the molecular mechanisms underlying the ability of helper strains to increase the cultivability of previously uncultured microbes can lead to the identification of specific growth factors, allowing expansion of the number of species that can be successfully cultured. This strategy was used by D’Onofrio et al. for the identification of new acyl-desferrioxamine siderophores (iron-chelating compounds) as growth factors produced by helper strains promoting the growth of previously uncultured isolates from marine sediment biofilm 117 , 126 . The siderophore-assisted growth is based on the property of these compounds to provide iron for microbes unable to autonomously produce siderophores themselves, and the application of this approach led to the isolation of previously uncultivated microorganisms 126 . The development of strategies to cultivate microbial symbionts that produce NPs only upon interaction with their hosts can promote access to new NPs. Microbial symbionts interacting with insects or other organisms are a highly promising reservoir for the discovery of novel bioactive NPs produced in a unique ecological context 127 – 130 . To stimulate NP production, culturing strategies can be developed that better mimic the native environment of microbial symbionts of insects, including the use of media containing either lyophilized dead insects 131 or l -proline, a major constituent of insect haemolymph 132 . Strategies to mimic the natural environment even more closely by harnessing in situ incubation in the environment from which the microorganism is sampled have been developed, dating back to more than 20 years ago with the biotech companies OneCell and Diversa. They developed platforms that allowed the growth of some previously uncultivated microbes from various environments based on diluting out and
Outlook for NPs in drug discovery
The technological advances discussed above have the potential to reinvigorate NP-based drug discovery in both established and emerging areas. NPs have long been the key source of new drugs against infectious diseases, especially antibiotics (reviewed elsewhere 151 , 152 ). Selected NPs with antimicrobial properties discovered by leveraging advances discussed in the sections above, including strategies to exploit the human microbiome for novel NPs 94 , 153 are highlighted in Figs 3 , 4 . Along with the search for new NPs with antimicrobial activities, researchers are continuing to develop and optimize already known NP classes, making use of advances in biosynthetic engineering 154 , total synthesis 155 or semi-synthetic strategies 156 , 157 . In addition, antivirulence strategies could represent an alternative approach to fighting infections 158 , for which NPs targeting bacterial quorum sensing could be of interest 159 . NPs also have a successful history as cancer therapeutics, which has been well covered in other reviews 160 – 163 . An important new opportunity in this field is the capacity of some NPs to trigger a selective yet potent host immune reaction against cancer cells, particularly given the intense interest at present in strategies that could improve response rates to immune checkpoint inhibitors by turning ‘cold’ tumours ‘hot’ 164 . For example, NPs such as cardiac glycosides 165 can increase the immunogenicity of stressed and dying cancer cells by triggering immunogenic cell death, characterized by the release of damage-associated molecular patterns (DAMPs), which could open new avenues for drug discovery or repurposing 166 – 168 . Botanical therapies containing complex mixtures of NPs have long attracted interest owing to the potential for synergistic therapeutic effects of components within the mixture 169 , 170 . However, the variability of the NP composition in the starting plant material owing to factors such as environmental variations in the location at which the plants were collected is a major challenge for the development of botanical drugs 1 . With the advances in technology for their characterization, such as metabolomics discussed above, as well as development of regulatory guidance for complex mixtures of NPs ( see Related links ), it is becoming more feasible to develop such mixtures as therapeutics, rather than to identify and purify a single active ingredient 171 . Since gut microbiota are considered to play a major role in health and disease 172 – 174 , and NPs are known to affect the gut microbiome composition 175 – 178 , this area is an emerging opportunity for NP-based drug discovery. However, drug discovery efforts in this area are still in their infancy, with many open questions remaining 179 . A future direction may be the characterization of single microbiota-derived species for particular therapeutic applications, and the advances in culturing strategies, genome mining and analytics discussed above will be of great importance in this respect. Many advances discussed above are supported by computational tools including databases (such as genomic, chemical or spectral analysis data; see ref. 180 for a recent review on NP databases) and tools that enable the analysis of genetic information, the prediction of chemical structures and pharmacological activities 181 , the integration of data sets with diverse information (such as tools for multi-omics analysis) 182 and machine learning applications 183 . Although this Review focuses on technologies that enable the discovery of novel NPs, it is important to acknowledge that unmodified NPs may possess suboptimal efficacy or absorption, distribution, metabolism, excretion and toxicity (ADMET) properties. So, for development of NP hits into leads and ultimately into successful drugs, chemical modification may be required. In addition, bringing a compound into clinical development requires a sustainable and economically viable supply of sufficient quantities of the compound. Total chemical synthesis, semi-synthesis using a NP as a starting point for analogue generation and biosynthetic engineering modifying biosynthetic pathways of the producing organism will be of great importance in this context (Fig. 5 ). Recent advances in chemical synthesis and biosynthetic engineering technologies are strongly empowering NP-based drug discovery and development by enabling property optimization of complex NP scaffolds that were previously regarded as inaccessible. This allows the enrichment of screening libraries with NPs, NP hybrids, NP analogues and NP-inspired molecules, as well as superior structure functionalization approaches (including late-stage functionalization) for optimization of NP leads 94 , 105 – 108 , 184 – 188 . Fig. 5 Strategies to obtain natural product analogues with superior properties. Unmodified natural products (NPs) often possess suboptimal properties, and superior analogues need to be obtained in order to yield valu
| DOI | 10.1038/s41573-020-00114-z |
| PubMed ID | 33510482 |
| PMC ID | PMC7841765 |
| Journal | Nature Reviews Drug Discovery |
| Year | 2021 |
| Authors | Atanas G. Atanasov, Sergey B. Zotchev, Verena M. Dirsch, the International Natural Product Sciences Taskforce, İlkay Erdoğan Orhan, Maciej Banach, Judith M. Rollinger, Davide Barreca, Wolfram Weckwerth, Rudolf Bauer, Edward A. Bayer, Shaheen Majeed, Anupam Bishayee, Valery N. Bochkov, Günther K. Bonn, Nady Braidy, Franz Bučar, Alejandro Cifuentes, Grazia D’Onofrio, Michael J. Bodkin, Marc Diederich, Albena T. Dinkova‐Kostova, Thomas Efferth, Khalid El Bairi, Nicolas Arkells, Tai-Ping Fan, Bernd L. Fiebich, Michael Freissmuth, Milen I. Georgiev, Simon Gibbons, Keith M. Godfrey, Christian W. Gruber, Jag Heer, Lukas A. Huber, Elena Ibáñez, Anake Kijjoa, Anna K. Kiss, Aiping Lü, Francisco A. Macı́as, Mark J.S. Miller, Andrei Mocan, Rolf Müller, Ferdinando Nicoletti, George Perry, Valeria Pittalà, Luca Rastrelli, Michael Ristow, Gian Luigi Russo, A. Sanches‐Silva, Daniela Schuster, Helen Sheridan, Krystyna Skalicka‐Woźniak, Léandros Skaltsounis, Eduardo Sobarzo‐Sánchez, David S. Bredt, Hermann Stuppner, Antoni Sureda, Nikolay Tzvetkov, Rosa Anna Vacca, Bharat B. Aggarwal, Maurizio Battino, Francesca Giampieri, Michaël Wink, Jean-Luc Wolfender, Jianbo Xiao, Andy Wai Kan Yeung, Gérard Lizard, Michael Popp, Michael Heinrich, Ioana Ilie, Marc Stadler, Maria Daglia, Robert Verpoorte, Claudiu T. Supuran |
| License | Open Access — see publisher for license terms |
| Citations | 4,763 |