G protein-coupled receptors: structure- and function-based drug discovery
Dehua Yang, Qingtong Zhou, Viktorija Labroska et al.
Research Article — Peer-Reviewed Source
Original research published by Yang et al. in Signal Transduction and Targeted Therapy. Redistributed under Open Access — see publisher for license terms. MedTech Research Group provides these references for informational purposes. We do not conduct original research. All studies are the work of their respective authors and institutions.
As one of the most successful therapeutic target families, G protein-coupled receptors (GPCRs) have experienced a transformation from random ligand screening to knowledge-driven drug design. We are eye-witnessing tremendous progresses made recently in the understanding of their structure-function relationships that facilitated drug development at an unprecedented pace. This article intends to provide a comprehensive overview of this important field to a broader readership that shares some common interests in drug discovery.
Abstract
As one of the most successful therapeutic target families, G protein-coupled receptors (GPCRs) have experienced a transformation from random ligand screening to knowledge-driven drug design. We are eye-witnessing tremendous progresses made recently in the understanding of their structure–function relationships that facilitated drug development at an unprecedented pace. This article intends to provide a comprehensive overview of this important field to a broader readership that shares some common interests in drug discovery.
Introduction
G protein-coupled receptors (GPCRs) represent the largest protein family encoded by the human genome. Located on the cell membrane, they transduce extracellular signals into key physiological effects. 1 Their endogenous ligands include odors, hormones, neurotransmitters, chemokines, etc., varying from photons, amines, carbohydrates, lipids, peptides to proteins. GPCRs have been implicated in a large number of diseases, such as type 2 diabetes mellitus (T2DM), obesity, depression, cancer, Alzheimer’s disease, and many others. 2 Activated by external signals through coupling to different G proteins or arrestins, GPCRs elicit cyclic adenosine 3,5-monophosphate (cAMP) response, calcium mobilization, or phosphorylation of extracellular regulated protein kinases 1/2 (pERK1/2). 3 The seven-transmembrane protein property endows them easy to access, while the diversified downstream signaling pathways make them attractive for drug development. 4 The human GPCR family is divided into classes A (rhodopsin), B (secretin and adhesion), C (glutamate), and F (Frizzled) subfamilies according to their amino acid sequences (Fig. 1 ). Of the 826 human GPCRs, approximately 350 non-olfactory members are regarded as druggable and 165 of them are validated drug targets (Fig. 1 and Table S1 ). 4 – 6 Latest statistical data indicate that 527 Food and Drug Administration (FDA)-approved drugs 4 and ∼60 drug candidates currently in clinical trials target GPCRs (Table S1 ). 5 Fig. 1 Phylogenetic tree of GPCRs as drug targets. Node represents GPCR named according to its gene name. Receptors with approved drugs on the market are highlighted by color. GPCRs are organized according to GPCR database. 4 Approved drug list was derived from previous publications, 4 , 11 complemented by additional search of newly approved entities at Drugs@FDA ( accessdata.fda.gov ) until June 2020. See Table S2 for details Started with crystal structure determination and accelerated by cryo-electron microscopy (cryo-EM) technology, three-dimensional (3D) structural studies on a variety of GPCRs in complex with ligands, G proteins/arrestins, or both 7 – 10 (involving 455 structures from 82 different receptors) significantly deepened our knowledge of molecular mechanisms of signal transduction. Novel insights into ligand recognition and receptor activation are gained from inactive, transitional, active, and apo states, thereby offering new opportunities for structure-based drug design (SBDD). 11 Pharmacological parameters such as cAMP accumulation, calcium flux, ERK phosphorylation, arrestin recruitment, and G protein interaction, 12 , 13 are commonly used to evaluate ligand action and biased signaling. Ligand-binding kinetics and signaling timing render another dimension for interpreting signal bias profiles and link in vitro bioactivities with in vivo effects. 14 In this process, a series of biased and allosteric modulators were discovered by rational design, ligand screening, and pharmacological assessment leading to the identification of novel binding sites or action modes. 15 , 16 Apart from crystallography and cryo-EM, the striking advancement in GPCR biology is also attributable to the deployment of powerful technologies such as nuclear magnetic resonance (NMR), hydrogen–deuterium exchange (HDX), fluorescence resonance energy transfer, bioluminescence resonance energy transfer, surface plasmon resonance, single molecule fluorescence, CRISPR/Cas9, artificial intelligence, etc. This review systematically summarizes the latest information on this important drug target family to cover both basic and translational sciences in the context of drug discovery and development.
GPCR as drug target
Class A Class A GPCRs, the so called “rhodopsin-like family” consisting of 719 members, are divided into several subgroups: aminergic, peptide, protein, lipid, melatonin, nucleotide, steroid, alicarboxylic acid, sensory, and orphan. 17 They have a conventional transmembrane domain (TMD) that forms ligand-binding pocket and additional eight helices with a palmitoylated cysteine at the C terminal. 18 , 19 Given the wide range of their physiological functions, this class of receptors is the most targeted therapeutically among all other classes. By manually curating Drugs@FDA original New Drug Application (NDA) and Biologic License Application (BLA) database (data extracted from August 2017 to June 2020) and cross-referencing with Drugbank, 20 IUPHAR and ChemBL databases, we were able to find the approved drugs associated with this class. Over 500 GPCR drugs target class A and many of them act at >1 receptor: 75% are made against aminergic receptors and 10% for peptidic ligand receptors with indications ranging from analgesics, allergies, cardiovascular diseases, hypertension, pulmonary diseases, depression, migraine, glaucoma, Parkinson’s disease to schizophrenia, cancer-related fatigue, etc. Approximately 500 novel drug candidates are in clinical trials. Of them, 134 are for peptide-activated GPCRs, while small molecules still occupy the majority. It is noted that 6% of class A members are sensory and alicarboxylic acid receptors that have broad untapped therapeutic potentials (Table S1 ). Chemokine, prostanoid and melanocortin receptors constitute >8% clinical trial targets in this class. In the past 3 years, about 20 NDAs were approved targeting mostly peptide and aminergic receptors (Table 1 ). Siponimod and ozanimod provide alternatives to fingolimod (approved in 2010) for treating relapsing forms of multiple sclerosis by modulating sphingosine-1-phosphate receptor. Two radiolabeled ligands, gallium 68 dotatoc and lutetium 177 dotatate, have been approved for neuroendocrine tumor and pancreatic gastrointestinal cancer diagnosis, respectively. Pitolisant, a selective inverse agonist of histamine receptor, is used to treat narcolepsy-related daytime sleepiness, while lemborexant, an orexin receptor antagonist, is used for insomnia management. Gilteritinib (ASP2215) is a small molecule inhibitor of tyrosine kinase. However, it also antagonizes serotonin receptors without any reported pharmacological consequences. Revefenacin is a long-acting antagonist of muscarinic acetylcholine receptors (mAChRs) indicated for chronic obstructive pulmonary disease. Amisulpride, trialed for antiemetic and schizophrenia, was finally approved for antiemetic in 2020. This molecule is acting as an antagonist against dopamine and serotonin receptors. Fosnetupitant, a prodrug of netupitant, was approved for chemotherapy-induced nausea and vomiting. Cysteamine treats radiation sickness via modifying action of neuropeptide Y receptor. Cannabidiol is one the active constituents of the Cannabis plant and was trialed for schizophrenia, graft versus host disease, and anticonvulsant. It was eventually approved in 2018 for the treatment of severe forms of epilepsy—Lennox–Gastaut syndrome and Dravet syndrome. Meanwhile, fostamatinib, indicated for chronic immune thrombocytopenia, targets >300 receptors and enzymes, including adenosine receptor A3. Table 1 Newly approved drugs targeting class A GPCRs in the past 3 years Drug Brand name (manufacturer) Indication Target GPCR class FDA approval date Gilteritinib Xospata (Astellas) Relapsed or refractory acute myeloid leukemia Serotonin receptors A, aminergic, 5-hydroxytryptamine 11/28/2018 Lasmiditan Reyvow (Eli Lilly) Migraine HTR1F A, aminergic, 5-hydroxytryptamine 10/11/2019 Revefenacin Yupelri (Mylan Ireland) Chronic obstructive pulmonary disease CHRM1-CHRM5 A, aminergic, acethylcholine 11/09/2018 Lumateperone Caplyta (Intra-Cellular) Schizophrenia HTR2A, DRD1, DRD2 A, aminergic, dopamine, 5-hydroxytryptamine 12/20/2019 Amisulpride Barhemsys (Acacia Pharma) Surgery-induced nausea and vomiting prevention DRD2, DRD3, HTR7, HTR2A A, aminergic, dopamine, 5-hydroxytryptamine 02/26/2020 Pitolisant Wakix (Harmony) Narcolepsy excessive daytime sleepiness HRH3 A, aminergic, histamine 08/14/2019 Angiotensin II Giapreza (La Jolla Pharma) Septic vasoconstrictor for adults AGTR1 A, peptide, angiotensin 12/21/2017 Macimorelin Macrilen (Novo Nordisk) Diagnosis of adult growth hormone deficiency GHSR A, peptide, ghrelin 12/20/2017 Elagolix Orilissa (Abbvie) Endometriosis-associated moderate-to-severe pain GNRHR A, peptide, gonadotropin 07/23/2018 Cysteamine Procysbi (Horizon Pharma) Radiation sickness NPY2R A, peptide, neuropeptide Y 02/14/2020 Lemborexant Dayvigo (Eisai) Insomnia HCRTR1 A, peptide, orexin 12/20/2019 Gallium 68 dotatoc NA (UIHC-PET Imaging Center) Diagnostic agent for neuroendocrine tumors SSTR2 A, peptide, somatostatin 08/21/2019 Lutetium 177 dotatate Lutathera (AAA USA) Gastro
Class B
This class of GPCRs is divided into two subfamilies: secretin (B1) and adhesion (B2), containing 15 and 33 members, respectively. 4 , 21 Secretin subfamily members are characteristic of large extracellular domains (ECDs) and bind to vasoactive intestinal peptide (VIP), pituitary adenylate cyclase-activating peptide (PACAP), corticotropin-releasing factor (CRF), parathyroid peptide hormone (PTH), growth hormone-releasing hormone (GHRH), calcitonin gene-related peptide (CGRP), glucagon, and glucagon-like peptides (GLPs), respectively. Adhesion subfamily has nine subgroups, possessing unique N-terminal motifs, such as epidermal growth factor, cadherin, and immunoglobulin domains. They are distinguished from other GPCRs due to their roles in cell adhesion and migration. 22 , 23 Apart from the long N-terminal domain, other unique features of the B2 subfamily are the GPCR autoproteolysis-inducing domain and the proteolysis site that are responsible for signaling activation through a Stachel sequence (a tethered agonist) and producing N-terminal fragment (NTF) and C-terminal fragment. The hallmarks of the B2 GPCR subfamily are a two-step activation model, the ligand–NTF interaction and the Stachel signaling/basal activity. Adhesion receptors can also signal independently of fragment dissociation and this has complicated pharmacological consequences. 22 , 24 , 25 In this class, receptors of glucagon family peptides, followed by CGRP, PTH, GHRH, CRF, VIP, and PACAP, constitute major targets for therapeutic intervention (Table S1 ) of various diseases, including obesity, T2DM, osteoporosis, migraine, depression, and anxiety. 26 , 27 To date, multiple GLP-1 receptor (GLP-1R) agonists have been developed by a combination of selective amino acid substitutions, enzymatic cleavage blockade, and conjugation to entities that increase binding to plasma proteins. These methods not only slow down fast renal clearance of the peptides but also extend their half-lives. Dose-dependent side effects such as nausea and gastrointestinal adverse events are the main drawbacks that are becoming more of a compliant with dose scaling. 28 , 29 For instance, one newly approved GLP-1R agonist, semaglutide, has a noticeable half-life of 168 h thereby allowing weekly subcutaneous administration, while oral semaglutide (approved in 2019) formulated using absorption enhancer shows a similar half-life but is dosed daily with reported side effects (Table 2 ). 30 , 31 Table 2 Newly approved drugs targeting class B GPCRs in the past 3 years Drug Brand name (manufacturer) Indication Target GPCR class FDA approval date Erenumab-aooe Aimovig (Amgen) Migraine (prevention) CALCRL B1, peptide, calcitonin 05/17/2018 Ubrogepant Ubrelvy (Allergan) Migraine CALCRL B1, peptide, calcitonin 12/23/2019 Rimegepant Nurtec ODT (Biohaven Pharm) Migraine CALCRL B1, peptide, calcitonin 02/27/2020 Eptinezumab-jjmr Vyepti (Lundbeck) Migraine (prevention) CALCRL B1, peptide, calcitonin 02/21/2020 Semaglutide (injection) Ozempic (Novo Nordisk) Type 2 diabetes mellitus GLP-1R B1, peptide, glucagon-like peptide-1 12/05/2017 Semaglutide (oral) Rybelsus (Novo Nordisk) Type 2 diabetes mellitus GLP-1R B1, peptide, glucagon-like peptide-1 09/20/2019 The drugs listed above were identified manually from Drugs@FDA original NDA and BLA database (data extracted from August 2017 to June 2020) and cross-referenced with Drugbank, 20 IUPHAR, and ChemBL databases CALCRL (CGRPR) calcitonin gene-related peptide type 1 receptor One of the latest approaches to develop more efficacious therapeutics against T2DM and obesity relates to dual- and tri-agonists targeting two or more of GLP-1R, glucagon receptor (GCGR), and glucose-dependent insulinotropic peptide receptor (GIPR). Many of them are currently in different phases of clinical trials (Table 3 ). 32 – 37 Of note, in this receptor family, GLP-2 stimulates intestinal growth and an approved GLP-2R agonist, teduglutide, is used to treat short bowel syndrome. 38 Table 3 Mono-, dual- and tri-agonists targeting GLP-1R, GCGR, and GIPR Receptor Drug Dose form Manufacturer Status GLP-1R mono-agonist Exenatide SC, twice daily AstraZeneca Approved Liraglutide SC, once daily Novo Nordisk Approved Exenatide SC, once weekly AstraZeneca Approved Lixisenatide SC, once daily Sanofi-Aventis Approved Albiglutide SC, once weekly GlaxoSmithKline Approved Dulaglutide SC, once weekly Eli Lilly Approved Semaglutide SC, once weekly Novo Nordisk Approved Semaglutide Oral, once daily Novo Nordisk Approved GLP-1R/GCGR dual-agonist HM12525A SC, once weekly Hamni Pharmaceuticals Phase 2 JNJ54728518 SC Janssen Pharmaceuticals Phase 1 MEDI0382 SC, once daily MedImmune Phase 2 MK8521 SC, once daily Merck Phase 2 NN9277 SC Novo Nordisk Phase 1 MOD6030 SC, once monthly Prolor Biotech, Opko Health Phase 1 SAR425899 SC, once daily Sanofi-Aventis Phase 2 VPD107 SC, once weekly Spitfire Pharma Preclinical TT401 SC, once weekly Transition Therapeutics Phase 2 ZP2929 SC, once daily
Classes C and F
Class C (glutamate) contains 22 receptors, which are further divided into 5 subfamilies including 1 calcium-sensing receptor (CaSR), 2 gamma-aminobutyric acid (GABA) type B receptors (GABA B1 and GABA B2 ), 3 taste 1 receptors (TS1R1–3), 8 metabotropic glutamate receptors (mGluR1–8), and 8 orphan GPCRs. 48 The distinctive features of glutamate subfamily are their large ECD and obligated constitutive dimer for receptor activation. 49 The structural information of ECD indicates the roles of conserved venus fly trap (VFT) and cysteine-rich domain (CRD) on the ligand-binding site. Two conserved disulfide bonds between VFT domains stabilize the homodimers or heterodimers of class F GPCRs. 50 The cryo-EM structures of the first full-length mGluR5 51 and more recently the GABA B Rs further revealed their assembly mechanism and overall architecture. 52 – 55 To date, 16 drugs have been approved by the FDA targeting 8 class C GPCRs. As archetypal receptors, mGluRs mediate the stimulus of agonists such as glutamate and their malfunction are implicated in various diseases, including cancer, schizophrenia, depression, and movement disorders. Acamprosate, an antagonist of mGluR5, was launched in 2004 as an anti-neoplastic agent. 56 In fact, mGluRs have been vigorously pursued as therapeutic targets and there are 15 drug candidates undergoing clinical trials at present for pain, migraine, Parkinson’s disease, Fragile X syndrome, etc. Although allosteric modulators of class C have attracted significant development efforts involving 8 clinical trial stage compounds [2 positive (PAM) and 6 negative (NAM) allosteric modulators], the only success is cinacalcet, a small molecule PAM of CaSR approved in 2004 for hyperparathyroidism and calcimimetics. 57 Only one class F GPCR (smoothed receptor SMO) has been validated as a drug target whose small molecule antagonists were approved as anti-neoplastic agents. 58 Other 10 members of this class are all Frizzled receptors (FZD1–10), which mediate Wnt signaling and are essential for embryonic development and adult organisms. FZDs together with cognate Hedgehog and Wnt signal are associated with a variety of diseases such as cancer, fibrosis, and neurodegeneration. 59 They share a conserved CRD in the extracellular part and ECD structures of SMO and FZD2/4/5/7/8 were determined. 60 However, only SMO, FZD4, and FZD5 have TMD structures. 61 – 63 Lack of full-length structures and complexity in signaling pathways impeded drug discovery initiatives. 60 Linking of Wnt with extracellular CRD would activate downstream signaling, while the dimerization process and the interaction between CRD and TMD remain elusive. 64 It is known that the downstream effectors of Wnt signaling consist of β-catenin, planar cell polarity, and Ca 2+ pathways, whereas receptor activation involves in Wnt, Norrin, FZD, LDL receptor-related protein 5/6, and many other co-factors. 64 Key breakthrough is thus required to advance our knowledge of these receptors.
Medicinal chemistry of GPCR
Agent type Agents targeting GPCRs continue to expand in the past decades. Among them, exogenous small molecules, including traditionally developed synthetic organics, natural products, and inorganics, still dominate with a total percentage of 64% (Fig. 2 ). Nevertheless, the proportion of small molecules declines since 2010. In addition to traditional ligand discovery, several new modalities appear, though currently at the stage of academic research. Covalent ligands, with the embedding of reactive moieties that can be covalently linked to receptors, significantly enhance the weak binding of unoptimized leads. 65 , 66 Photoactive ligands, developed by the introduction of photo-responsive groups to drug candidates, bring a new interdisciplinary field, photopharmacology. Albeit in its infancy, it has already found in vivo applications. 67 , 68 Fig. 2 Analysis on agents targeting GPCRs. Distribution of molecule type (left) and action mode (right). Positive, PAM; Negative, NAM In comparison, biologicals, such as peptides, antibodies, and metabolites, become more and more visible in the list. Particularly, the number of approved peptide drugs occupies approximately one third of the whole repertoire, with many more in different clinical stages as the pipeline 41 , 69 —most of them target classes A and B GPCRs. Naturally occurring peptides have been continually discovered from plants, animals, fungi, and bacteria. Although they act as efficient chemical messengers to modulate cellular functions, these peptides suffer from unfavorable pharmacokinetic and pharmacodynamics properties, such as very short plasma half-lives and low plasma protein binding. Therefore, chemical modifications are required to promote the membrane permeability, brain penetration, and oral bioavailability. 70 Available strategies include peptide cyclization, N -methylation, palmitoylation, unnatural amino acid insertion, peptide–small molecule conjugation, and peptide self-assembly. By the way, developing peptidic agents may offer a new approach to de-orphanize certain orphan GPCRs. 71 mAbs represent a promising alternative in GPCR drug discovery. 72 , 73 Over small molecules, mAbs possess obvious advantages of improved specificity, affinity, and other pharmacological properties. Thus they are being developed against cancers, inflammation, and metabolic disorders. To date, three GPCR-targeting mAbs were approved (mogamulizumab, erenumab, and eptinezumab) while bi-specific antibodies, nanobodies, antibody–drug conjugates, and antibody–peptide conjugation are also in the development stage. The emergence of many conceptually new molecular entities, such as RNA aptamer, provides not only powerful tool for biophysical study but also potential therapeutic candidates. 74 Usually, aptamer has great molecular diversity and little immunogenicity. 75 In addition, GPCRs are known to function by forming dimers (homodimers or heterodimers) and oligomers on the cell membrane. 76 Therefore, strategies to induce receptor dimerization and/or oligomerization have received attention using scaffolds based on DNA (aptamer), small molecule, and physical stimuli. 77
Structure–activity relationship (SAR)
Studies of SARs are critical to the identification of drug-like molecules, especially when the crystal or cryo-EM structure of a drug target is not available. Given that many 3D GPCR structures have been solved in the past decade, most approved drugs were discovered without relevant structural information. Two examples are reviewed below to show the importance of SAR analysis. Orexin-1 and orexin-2 receptors (also known as hypocretin receptors, OX 1 R and OX 2 R) are class A GPCRs for which two endogenous peptide ligands were identified, orexin A and orexin B (also known as hypocretin 1 and hypocretin 2). The orexin signaling system plays a crucial role in regulating the sleep/wake cycle—both OX 1 R and OX 2 R are involved while the precise contribution of each has yet to be defined. Therefore, dual antagonists were developed as potential treatment for insomnia. 78 Suvorexant (belsomra), the first-in-class dual orexin receptor antagonist, was launched in 2014. 79 The second, lemborexant/E2006 (dayvigo) developed by Eisai, was approved by the FDA in 2020. 80 It started from hit compound 1 ( 6 , Fig. 3 ) with modest binding affinity to OX 2 R ( K i = 8.7 µM) and no affinity for OX 1 R. 81 The first round of SAR studies revealed that changing the ketone group to an amide led to a remarkable enhancement (~1000-fold) of binding affinity at both OX 1 R and OX 2 R (compound 2 ). Substitution of the aniline group with a 2-amino-5-cyano pyridine (compound 3 ) maintained OX 2 R affinity and reduced OX 1 R activity, but physicochemical properties were improved compared to compound 2 . 81 Further SAR studies focused on the modification of all three aromatic substitutions in compound 3 . 82 Changing the di -OMe-phenyl substituent to a pyrimidine group resulted in a significant loss of binding affinity, as shown with compound 4 , but an improved overall profile due to reduced lipophilicity and enhanced solubility. Then replacing the cyano group to a fluorine regained the binding affinity for both receptors (compound 5 ), and finally adding a second fluorine to the benzene group significantly improved OX 1 R affinity and led to lemborexant. 82 Clearly, slight structural modifications may cause significant change of compound activity, and SAR studies coupled with optimization of physicochemical properties are useful steps to obtain druggable candidates. Fig. 3 SAR studies that led to the discovery of the dual orexin receptor antagonist lemborexant CGRP is a 37-amino acid neuropeptide and its receptor is implicated in migraine. 83 The benzodiazapinone compound 7 was identified as a hit compound with modest CGRP receptor binding affinity ( K i = 4.8 µM, Fig. 4 ). 84 Replacing the right-hand spirohydantoin structure with piperidyldihydroquinazolinone, a privileged structure for CGRP receptor antagonists, 85 an affinity boost of 100-fold was gained. 84 Further optimization of the benzodiazepinone core resulted in the caprolactam compound 9 , which showed a K i of 25 nM. 86 Changing the piperidyldihydroquinazolinone moiety to a piperidylazabenzimidazolone led to compound 10 , with a binding affinity of 11 nM. 87 Then by changing the N-substituent on the caprolactam and adding di -fluro substitutions on the lower benzene ring delivered compound MK-0974 ( 11 , K i = 0.77 nM, Fig. 4 ), which entered clinical trials. Compound 12 (BMS-846372) shares the same piperidylazabenzimidazolone and the lower diflurobenzene substructures with 11 but differs from the latter with a carbamate core structure and a pyridine-fused-cyclopentane in replacement of the caprolactam. 88 Compound 11 displayed high binding affinity while suffered from poor physicochemical properties, such as low solubility. To improve this, a hydroxyl group was attached to the cycloheptane ring and it was discovered that the ( S )-isomer 13 was more potent than the ( R )-OH compound 14 . The –OH was finally replaced with an -NH 2 group, which led to the clinical compound rimegepant. 89 The latter was further developed for better safety and efficacy profiles and obtained regulatory approval by the FDA in 2020. Fig. 4 SAR studies that resulted in the discovery of CGRP antagonists The above examples demonstrate that, starting from a modest affinity hit compound, systematic SAR studies could successfully lead to very potent GPCR ligands that qualify as clinical candidates. Slight modifications of chemical structures sometimes cause remarkable changes of binding affinity or potency, which could not always be accurately predicted by conventional methods, such as docking. Therefore, SAR studies will continue to play a critical role in drug discovery.
GPCR structure
The structure of GPCRs is a crucial determinant for understanding the molecular mechanisms underlying ligand recognition and receptor activation. It provides a foundation for drug discovery. The first crystal structure of inactive state rhodopsin purified from bovine eyes was solved in 2000. 90 Although tremendous efforts have been made, elucidation of GPCR structures remains challenging due to several bottlenecks, including low receptor expression level, difficulties in extraction, highly flexible conformation, lack of crystal contacts, etc. The first crystal structure of GPCR extracted from exogenously expressed host cells, the human β2-adrenergic receptor (β 2 AR, gene name: ADRB2 ) bound to an antagonist, was disclosed in 2007, representing a milestone in GPCR structural biology. 7 Several innovative methods, especially the incorporation of a soluble fusion partner and lipidic cubic phase (LCP) crystallization, facilitated subsequent studies. Further technological breakthroughs in protein expression and purification, 91 , 92 receptor engineering, 8 , 93 application of Fab fragment and nanobody, 94 , 95 and GPCR crystallization 96 led to an exponential growth of this field. The crystal structure of β 2 AR in complex with stimulatory G protein (G s ) solved in 2011 97 and rhodopsin bound to visual arrestin reported in 2015 98 revealed the molecular mechanism of GPCR interaction with G protein and arrestin, respectively. Notably, the wave of resolution revolution in the single-particle cryo-EM has brought a significant impact on the determination of GPCR complexes. 10 Over 90% of GPCR–transducer complex structures were solved using cryo-EM (Table 4 ). To date, a total of 455 structures from 82 GPCRs belonging to all classes except B2 have been reported (Table 4 ). Although GPCRs show extensive sequence diversity, they share a conversed structural architecture of a TMD composed of seven helices embedded in the cell membrane. The transmembrane (TM) helices, essential for signal transduction across the cell membrane, are linked by three extracellular loops (ECLs) and three intracellular loops (ICLs). However, distinct structural features exist among members from different classes despite their overall structural similarity. Table 4 List of GPCR structures Receptor Number of structures PDB code (GPCR structure without downstream effector) PDB code (GPCR structure with downstream effector) Class A ADORA1 3 5N2S, 5UEN 6D9H ADORA2A 49 2YDO, 2YDV, 3EML, 3PWH, 3QAK, 3REY, 3RFM, 3UZA, 3UZC, 3VG9, etc. 6GDG ADRA2B 2 6K41 , 6K42 ADRB1 27 2VT4, 2Y00, 2YCW, 3ZPQ, 4AMI, 4BVN, 4GPO, 5A8E, 5F8U, 6H7J, etc. 6TKO , 7JJO ADRB2 33 2R4R, 2RH1, 3D4S, 3KJ6, 3NY8, 3P0G, 3PDS, 4GBR, 4LDE, 5D5A, etc. 3SN6, 6NI3 AGTR1 6 4YAY, 4ZUD, 6DO1, 6OS1, 6OS2, 6OS0 AGTR2 5 5UNF, 5UNG, 5UNH, 5XJM, 6JOD, APLNR 1 5VBL C5AR1 3 5O9H, 6C1Q, 6C1R CCR2 3 5T1A, 6GPS, 6GPX CCR5 6 4MBS, 5UIW, 6AKX, 6AKY, 6MEO, 6MET CCR6 1 6WWZ CCR7 1 6QZH CCR9 1 5LWE CHRM1 3 5CXV, 6WJC 6OIJ CHRM2 11 3UON, 4MQS, 4MQT, 5YC8, 5ZK3, 5ZK8, 5ZKB, 5ZKC 6OIK , 6U1N , 6UP7 CHRM3 5 4DAJ, 4U14, 4U15, 4U16, 5ZHP CHRM4 1 5DSG CHRM5 1 6OL9 CNR1 7 5TGZ, 5U09, 5XR8, 5XRA, 6KQI 6N4B , 6KPG CNR2 4 5ZTY, 6KPC 6PT0 , 6KPF CXCR1 1 2LNL CXCR2 3 6LFL 6LFM , 6LFO CXCR4 6 3ODU, 3OE0, 3OE6, 3OE8, 3OE9, 4RWS CYSLTR1 2 6RZ4, 6RZ5 CYSLTR2 4 6RZ6, 6RZ7, 6RZ8, 6RZ9 DRD2 2 6CM4 6VMS DRD3 1 3PBL DRD4 3 5WIU, 5WIV, 6IQL EDNRB 8 5GLH, 5GLI, 5X93, 5XPR, 6IGK, 6IGL, 6K1Q, 6LRY F2R 1 3VW7 F2RL1 3 5NDD, 5NDZ, 5NJ6 FFAR1 4 4PHU, 5KW2, 5TZR, 5TZY FFAR2 2 6LW5 6OMM GPBAR1 2 7CFM , 7CFN GPR52 4 6LI1, 6LI2, 6LI0 6LI3 HCRTR1 11 4ZJ8, 4ZJC, 6TO7, 6TOD, 6TP3, 6TP4, 6TP6, 6TQ4, 6TQ6, 6TQ7, 6TQ9 HCRTR2 6 4S0V, 5WQC, 5WS3, 6TPG, 6TPJ, 6TPN HRH1 1 3RZE HTR1B 4 4IAQ, 4IAR, 5V54 6G79 HTR2A 5 6A93, 6A94, 6WH4, 6WGT 6WHA HTR2B 8 4IB4, 4NC3, 5TUD, 5TVN, 6DRX, 6DRY, 6DRZ, 6DS0 HTR2C 2 6BQG, 6BQH LPAR1 3 4Z34, 4Z35, 4Z36 MC4R 1 6W25 MTNR1A 5 6ME2, 6ME3, 6ME4, 6ME5, 6PS8 MTNR1B 4 6ME6, 6ME7, 6ME8, 6ME9 NPY1R 2 5ZBH, 5ZBQ NTSR1 11 3ZEV, 4BUO, 4BV0, 4BWB, 4GRV, 4XEE, 4XES, 5T04 6OS9 , 6OSA , 6PWC OPRD1 4 4EJ4, 4N6H, 4RWA, 4RWD OPRK1 2 4DJH, 6B73 OPRL1 3 4EA3, 5DHG, 5DHH OPRM1 4 4DKL, 5C1M 6DDE , 6DDF OXTR 1 6TPK P2RY1 2 4XNV, 4XNW P2RY12 3 4NTJ, 4PXZ, 4PY0 PTAFR 2 5ZKP, 5ZKQ PTGDR2 2 6D26, 6D27 PTGER3 2 6AK3, 6M9T PTGER4 2 5YHL, 5YWY RHO 55 1F88, 1GZM, 1HZX, 1L9H, 1LN6, 1U19, 2G87, 2HPY, 2I35, 2J4Y, etc. 4ZWJ, 5DGY, 5W0P, 6FUF, 6CMO , 6OY9 , 6OYA , 6QNO S1PR1 2 3V2W, 3V2Y TACR1 9 2KS9, 2KSA, 2KSB, 6HLL, 6HLO, 6HLP, 6J20, 6J21, 6E59 TBXA2R 2 6IIU, 6IIV US28 4 4XT1, 4XT3, 5WB1, 5WB2 Class B ADCYAP1R1 4 6P9Y , 6M1H , 6M1L , 6LPB CALCR 2 5UZ7 , 6NIY CALCRL 4 6E3Y , 6UVA , 6UUN , 6UUS CRHR1 4 4K5Y, 4Z9G 6PB0 , 6P9X CRHR2 1 6PB1 GCGR 9 4L6R, 5EE7, 5XEZ, 5XF1, 5YQZ 6LMK , 6LML , 6WHC , 6WPW GHRHR 1 7CZ5 GLP-1R 12 5NX2, 5VEW, 5VEX, 6KJV, 6KK1, 6KK7, 6LN2 5VAI , 6B3J , 6ORV , 7C2E , 6VCB GLP-2R 1 7D68 PTH1R 4 6FJ3 6NBF , 6NBH , 6NBI SCTR 2 6WZG , 6WI9 VIPR1 1 6VN7 Class C GABBR2 8 7C7S , 7C7Q , 6UO8 , 6VJM , 6UOA , 6UO9 , 6W2X , 6WIV GRM1 1
GPCR pharmacology
The explosion of 3D GPCR structures and computational simulations has revealed the dynamic conformations between inactive, intermediate, and active states of GPCRs. The detailed structural information illustrated that cholesterol, ion, lipids, and water also participate in receptor activation. 99 , 119 The flexibility of receptor-binding pocket endows the complex pharmacological mechanisms of ligand recognition and signal transduction. Biased signaling, allosteric modulation, and polypharmacology are helping us better understand how GPCRs bind to numerous ligands and how they transmit diverse signals to elicit physiological functions. Polypharmacology Ligand binding to multiple targets leads to antagonism, additive, or synergism pharmacological responses that could be positive or negative based on the mechanism of action. The paradigm of one drug vs. multiple targets has outpaced the time and cost associated with the conventional therapy. 120 Polypharmacology thus emerges to study acceptable degree of specificity toward multiple targets, interconnected signaling pathways that result in clinical benefit or cross-reactivity that may cause adverse events. 121 , 122 T2DM, obesity, cancer, and Alzheimer’s disease are major indications for GPCR modulators. 4 These polygenic diseases are not completely treatable by a single agent, while desirable efficacies may be achieved for certain respiratory conditions, central nervous system (CNS) disorders, and cardiovascular diseases through modulators directed against β 2 AR, DRD2, and AGTR1, 123 respectively. It was shown that 5-hydroxytryptamine receptor 2 (5-HT 2 ) binds to selective inverse (ritanserin) and highly promiscuous (ergotamine) agonists but the interaction with ergotamine is broad. 121 This feature allows the development of pan serotonin receptor modulators to treat different diseases. 124 – 127 For instance, zolmitriptan as an anti-migraine drug is also used for hyperesthesia via binding to off-target site, 120 and lorcaserin (Belviq) is used to treat obesity while its therapeutic potential for depression, schizophrenia, and drug addiction is being investigated. 128 , 129 However, off-target activity, hallucinations, 130 and cardiac valvulopathy related to 5-HT 2A and 5-HT 2B modulation 129 should be carefully monitored. Atypical antipsychotics are mainly targeting both dopamine and serotonin receptors, usually as antagonist for DRD2 and antagonist or inverse agonist for 5-HT 2A. 131 Exemplified by clozapine 120 and aripiprazole, 132 haloperidol, amoxapine, and asenapine 4 display a diverse spectrum of receptor interaction. Additionally, carazolol, a member of aminergic division exerts its effects by interacting with multiple adrenergic receptors as inverse agonist or allosteric antagonist. 19 , 129 Istradefylline combined with L-DOPA/dopamine simultaneously target A 2A R, DRD1 and DRD2 in animal model of Parkinson’s disease. 131 Amitryptyline, a tricyclic compound targeting muscarinic and histamine H1 receptors, 133 is used to treat depression and non-selective muscarinic receptor antagonists are trialed for bladder dysfunction. 4 Lorazepam, indicated for anxiety due to interaction with GABA A R, is also an allosteric modulator of the proton-sensing GPCR (GPR68) 134 and has been repurposed to treat pancreatic cancer. 2 6’-Guanidinonaltrindole (6’-GNTI) is an agonist with higher selectivity for δ/κ-opioid receptor heterodimer but not homodimer. Importantly, 6’-GNTI is an analgesic that offers additional benefit. In cardiovascular diseases, β blockers decrease catecholamine-induced heart rate elevation via interaction with valsartan (AT1R-mediated signaling). 135 It is of note that mono-, dual-, and tri-agonists for the glucagon family of receptors (GLP-1R, GCGR, and GIPR) have been developed and trialed for weight loss and glucose control (Table 3 ). Successful outcome will determine whether unimolecular polypharmacology is a practical approach to translate safety and efficacy of multiple agents into a single molecule. 136
Biased agonism
Activated GPCRs can recruit multiple transducers (such as heterotrimeric G proteins, GPCR kinases, and β-arrestin) and consequently produce distinct biological responses. Ligands that preferentially engage one signaling pathway over others are regarded as bias and may show improved therapeutic outcomes. 137 , 138 Biased signaling that has been applied to drug discovery involve AT2R, µ-OR, κ-OR, β-adrenergic receptors, DRD2, CTR, CCR, and adenosine receptors. µ-OR is the best studied receptor for biased agonism. 137 Compounds that stimulate Gα i coupling and cAMP production but not β-arrestin recruitment are preferable to retain analgesia and reduce opioid-related side effects. 139 This G protein bias was also demonstrated with widely used drug tramadol, 140 whose active metabolite, desmetramadol, elicited maximum cAMP production without affecting β-arrestin 2 recruitment compared to fentanyl and morphine. Safety profile is improved with less adverse effect such as respiratory depression. 140 Another µ-OR-biased ligand, oliceridine (TRV130, Olinvo TM ), passed phase III clinical trial but did not get the FDA approval for safety concerns. 141 The NDA for oliceridine was resubmitted and a new counterpart, TRV734, is not only suitable for oral administration but also safer due to reduced dependency. 142 A fourth µ-OR-biased ligand, PZM21, cross-reacts with κ-OR and failed to reduce respiratory depression in C57BL and CD-1 mice. 143 Whether this relates to its residual but marked effect on β-arrestin 2 recruitment, as opposed to oliceridine whose action is negligible, 144 remains to be further studied. Similar situation occurred with κ-OR as well whose agonists possess analgesic property and have a low risk of dependence and abuse but with adverse effects such as sedation, motor dysfunction, hallucination, and dysphoria. 145 G protein-biased agonists of κ-OR, 146 including RB-64, 145 mesyl salvinorin B, triazole 1.1, diphenethylamines and LOR17, 141 were reported to minimize the adverse effects in preclinical settings. One of such, nalfurafine, was approved in Japan (2015) as an anti-pruritic agent for patients with chronic liver diseases. 147 Carvedilol, known as a β1 and β2 adrenoceptor blocker, was found to be biased toward β-arrestin recruitment, G protein-coupled receptor kinase activation, and ERK1/2 phosphorylation. Joining its rank included alprenolol, bucindolol, and nebivolol, all are used to treat hypertension and congestive heart failure. 148 In the case of β3 adrenoceptor, CL316243 is cAMP-biased, whereas L748337 and SR59230 are ERK/p38 phosphorylation-biased. 149 , 150 Interestingly, CL316243 was also tested for treatment of obese mice. 151 , 152 However, none of them have advanced to the clinic. In contrast to µ-OR, arrestin bias is desirable for AT1R to improve cardiac performance. 153 Nonetheless, clinical development of AT1R modulators either resulted in a phase IIb trial failure (TRV027) in 2017 154 or never reached to clinical stage (SBpa, SVdF, SI, sarmesin, saralasin, and SII). 155 Of note is that biased molecules may show species preference. For instance, CL316243 is more active in mice than in humans, 156 whereas nalfurafine works better in humans vs. rodents. 157 A list of therapeutic agents with biased signaling approved or advanced to clinical trials is shown in Table 5 . Table 5 Therapeutic agents with biased signaling approved or in clinical trials Ligand Receptor (GPCR class) Signaling bias Indication Development status Reference Bromocriptine Serotonin receptors, adenosine receptors, dopamine receptors (class A) At 5HT 2 G q/11 ; at DRD2 β-arrestin Acromegaly, Parkinson’s disease, T2DM, idiopathic hyperprolactinemic disorder, neuroleptic malignant syndrome Approved 297 – 299 Pergolide 5HT 2 (class A) G q/11 Parkinson’s disease Approved 297 , 298 Ergotamine HTR2B (class A) β-arrestin Migraine Approved 297 , 300 Atropine CHRM3 (class A) Low efficacy agonist for G 12 , inverse agonist for Gq, antagonist for G i Organophosphorous poison antidote Approved 297 , 301 Pilocarpine CHRM3 (class A) β-arrestin and pERK1/2 Xerostomia Approved 297 , 302 Capadenoson ADRA1A (class A) cAMP Atrial fibrillation Phase 2 297 , 303 Alprenolol, bucindolol, carvedilol, nebivolol ADRB1 and ADRB2 (class A) β-arrestin Congestive heart failure Approved 148 , 304 Isoetharine ADRB (class A) β-arrestin Asthma Approved 304 , 305 Dihydrexidine (DAR-0100A) DRD2 (class A) Full agonists for G s but partial agonists for G olf signaling Schizotypal personality disorder Phase 2 297 , 306 Bifeprunox DRD2 (class A) Kinetic bias Bipolar disorder, depression, schizophrenia, psychosis Phase 3 297 , 307 Aripiprazole DRD2 (class A) Kinetic bias Psychosis Approved 297 , 307 TRV027 (TRV120027) AGTR1 (class A) β-arrestin Anti-hypertensive with cardio-protection Phase 2 5 , 304 TRV250 OPRD1 (class A) G protein Migraine Phase 1 5 , 137 Nalfurafine OPRK1 (class A) G protein Pruritus Approved 157 Tramadol OPRM1 (class A) G protein Pa
Allosteric modulation
In recent years, studies on allosteric GPCR modulators have gained unprecedented momentum. 158 – 161 An allosteric modulator is a ligand binding to a position other than the orthosteric site but can modify responses of a receptor to stimulus. Allosteric modulators that enhance agonist-mediated response are called PAMs, while those attenuate the response are called NAMs. This phenomenon is very common such that the Allosteric Database 2019 (ASD, http://mdl.shsmu.edu.cn/ASD ) 162 records 37520 allosteric modulations on 118 GPCR members, covering all four classes. Allosteric modulation is advantageous in terms of (i) using highly druggable pockets. In some cases, it is easier to design ligands at an allosteric site than the orthosteric site, such as class B GPCRs with orthosteric pockets wide open. For example, both PAM 163 and NAMs 107 binding to the same position at the TMD of GLP-1R were reported; (ii) improving selectivity. The orthosteric site and cognate ligand are often highly conserved, making it hard to discover very selective orthosteric binders. Meanwhile, non-conserved allosteric sites would be a better choice evidenced by discovery of many subtype selective allosteric modulators of acetylcholine 102 , 164 and cannabinoid receptors 165 , 166 ; (iii) introducing signal bias. Allosteric modulators with biased signaling were developed for prostaglandin F2α receptor 167 and chemokine receptor CXCR4. 168 Albeit still as an emerging concept, allosteric modulators have exhibited a great potential with some compounds being marketed or in clinical trials. 160 However, developing allosteric modulators of GPCRs remains challenging—molecules recorded in the ASD largely concentrate on two subfamilies, the mGluRs (8 members, 17,115 modulations), and mAChRs (5 members, 7666 modulations), accounting for nearly 2/3 of the total number. Some individual receptors also contribute a significant proportion, such as CB1 (1948 modulations), GABA B (1286 modulations), and follicle-stimulating hormone receptor (1233 modulations). Excluding these “easy cases,” allosteric modulators are few in number. Furthermore, the structural diversity of the allosteric modulators is quite low, for many derivatives would be included soon after a parent compound is identified. The difficulty in developing allosteric modulators is partly due to the limitation of detecting allosteric behavior: Not every newly discovered active compound could be tested for its effect on binding affinity or EC 50 of an orthosteric agonist, therefore some allosteric modulators were not correctly identified. For instance, BPTU in P2RY1, the first GPCR NAM solved in complex structure (PDB code: 4XNV), 169 was not considered allosteric until the structure was obtained. To make things worse, NAMs may weaken the binding of an endogenous ligand thus behaving like a competitor, such as NDT9513727 in C5AR1 170 (PDB code: 5O9H). 171 The most effective way to identify the binding site of an allosteric modulator on a GPCR is solving the complex structure. Crystallography is an effective technique, while rapidly deployment of cryo-EM has started to deliver its promise (PDB codes: 6OIK 172 and 6U1N 173 ). To date, 17 GPCRs have reported structures in complex with allosteric modulators. Detailed analysis of complex structures before October 2018 was reported previously, 161 and here we focus on insights provided by newly published results. The most unusual allosteric-binding sites on GPCRs are at the lipidic interface embedded in cell membrane. Five different positions were identified by crystal structures (Fig. 9 ): UP12, UP34, LOW34, LOW345, and LOW67. Four of them were recently reviewed. 161 The LOW34 site was reported in 2019 for ORG27569 in CB1 (PDB code: 6KQI 166 ; Fig. 10a ). Fig. 9 Schematic diagram of allosteric sites at the lipidic surface identified by complex structures. The binding sites are manually labeled on the crystal structure of β 2 AR (PDB code: 6OBA 180 ). Solid line, allosteric site at front side; dashed line, allosteric site at back side. UP, upper part aka close to the extracellular end; LOW, lower part aka close to the cytoplasmic end; numbers, main interacting transmembrane helices Fig. 10 Binding sites of allosteric modulators in GPCRs reported after October 2018, in comparison with related ligands. a NAM ORG27569 in CB1 (PDB code: 6KQI 166 ) in comparison with cholesterol (PDB code: 5XRA 179 ); b NAM AS408 (PDB code: 6OBA 180 ) and PAM Cmpd-6FA (PDB code: 6N48 181 ) in β 2 AR, in comparison with NDT9513727 in C5AR1 (PDB code: 6C1Q 182 ) and PAM AP8 (PDB code: 5TZY 183 ); c NAM maraviroc in CCR5 (PDB code: 4MBS 187 ) in comparison with chemokine analog antagonist [5P7]CCL5 (PDB code: 5UIW 188 ) and HIV envelope glycoprotein gp120 (PDB code: 6MEO 189 ); d PAM TT-OAD2 in GLP-1R (PDB code: 6ORV 190 ) in comparison with GLP-1 (PDB code: 5VAI 191 ) ORG27569 attracted much attention for its distinctive function: increasing the binding of orthosteric agon
Disease indication
GPCRs are involved in many human diseases and specific drug intervention is one of the most celebrating achievements in the pharmaceutical industry (Table S3 and Fig. S1 ). Among all available drugs targeting GPCRs, HRH1, DRD2, M1R, and ADRA1A are the most frequently addressed for indications such as hypertension, allergy, pain, and schizophrenia, and 33% of them have >1 indication with an overall average of 1.5. Although CNS diseases are still popular accounting for 26% of all approved indications, development focuses have now been shifted to T2DM, obesity, multiple sclerosis, smoking cessation, short bowel syndrome, and hypocalcemia. Repurposing of existing drugs for new indications also emerged to supplement discovery efforts.
Structure-based drug design
As two general types of computer-aided drug design techniques (Table 6 ), 205 SBDD and ligand-based drug design, exploit the structural information of protein targets and the knowledge of known ligands, respectively (Table 6 ). SBDD, on the basis of crystal/cryo-EM/NMR structures or homology models, first identifies key sites and important interactions responsible for target functions, then screens large virtual library/designed agents that disrupt or enhance such interactions to modulate relevant biological processes and/or signaling pathways by molecular docking, and finally discovers active leads with desired pharmacological properties. Clearly, the past decade is a golden age for SBDD on GPCR. With the year of 2011 (when LPC crystallization, 206 fusion proteins, 8 and other key techniques collaborated to launch the outbreak of GPCR structure determination including the landmark β 2 AR-G s ) for watershed, 207 SBDD of GPCR evolves two distinct stages: rhodopsin-based homology model and truly authentic structure of individual receptors. Boosted by the fast-increasing number of high-quality GPCR structures, improved accuracy of combinational computational approaches, and better understanding of activation mechanism and pharmacology, SBDD is developing rapidly with fruitful scientific reports and increasing GPCR-targeted drugs contributed by this approach. Considering the length of time required for a drug to be available on the market (10–15 years) and the chance of applying structural biology information to hit discovery and lead optimization in the first 2–3 years of a drug discovery program, it is probably too early to see the approval of GPCR-targeted drugs being developed with the aid of a structure, and such situation is likely to change as the tremendous efforts from both academia and industry start to bear the fruits of successes. The following is a brief account of recent advances in three main aspects of SBDD (chemical space, receptor dynamics, and pose evaluation) in the context of their application in GPCR pharmacology. Optimized virtual library Despite the vast chemical space (>10 63 drug-like molecules), only a nominal fraction has been explored by SBDD, where both the compound availability and insufficient diversity limited the number of screened ligands. To overcome these problems, ultra-large 208 – 210 and focused libraries 211 – 213 were employed. Lyu and colleagues 208 presented an excellent model of “bigger is better” in virtual drug screening. Based on the 130 well-characterized reactions, they generated 170 million make-on-demand compounds ( http://zinc15.docking.org/ ), the resulting library is remarkably diverse with >10.7 million scaffolds unavailable before. By docking 138 million molecules against DRD4, they discovered 81 new chemotypes (24% hit rate), 30 of them showed submicromolar activity, including a 180-pM subtype-selective and G i -biased DRD4 agonist. This ultra-large library docking study provides important information: (i) hit rate fell almost monotonically with docking score; (ii) hit rate vs. score curve of DRD4 predicted that 1 from every 873 compounds may have a minimum affinity of 1 μM; and (iii) human visual evaluation improved the selected compound with higher affinities, efficacies, and potencies but not the hit rate. A follow-up study on MT 1 by docking >150 million “lead-like” molecules 209 identified 15 active leads (39% hit rate) with potencies ranging from 470 pM to 6 μM. Alternatively, focused library 210 – 215 including scaffold library, natural products, dark chemical matter (i.e., chemicals that have never shown bioactivity tested in over 100 assays), and fragment- and lead-like libraries were introduced in virtual screening (VS) for dozens of receptors. Focused on compound library of traditional Chinese medicine (TCM), Liu et al. found that salvianolic acids A and C antagonized the activity of both P2RY1 and P2RY12 purinoceptors in the low µM range, while salvianolic acid B antagonized the P2RY12 purinoceptor. Remarkably, these three salvianolic acids are major active components of the broadly used hemorheologic TCM Danshen ( Salvia militorrhiza ). Taking NTSR1 as an example, Ranganathan et al. found that the fragment library tended to have higher hit rate than that of the lead-like library (19%) but the affinities were ∼100-fold weaker. Collectively, these results demonstrate the importance and advantages of ultra-large and tailored libraries in discovering potent GPCR modulators.
Receptor dynamics
Emerging evidence from crystallography, spectroscopy, and molecular dynamics (MD) simulations have demonstrated the crucial roles of GPCR dynamics involved in ligand recognition, receptor activation, and allosteric modulation. 216 – 218 To consider the protein flexibility during GPCR-related SBDD, many computational approaches 217 including rotamer sampling, induced-fit docking, and ensemble docking have been employed showing a great promise, especially in the search of biased, bi-topic, or allosteric modulators. During ensemble docking, 212 , 217 , 219 – 221 ligands are docked into multiple structures representing different possible conformational states rather than a single structure, where the targets could be multiple crystal structures or extracted from MD/Monte Carlo (MC) simulations or normal mode analysis (NMA). By evaluating the known ligand enrichment, as well as selectivity for agonists or antagonists on seven GPCR/ligand co-structures, Coudrat et al. found that small variations in structural features are responsible for their success in VS, while a combination of ligand/receptor interaction patterns and predicted interaction strength is associated with the predictive power of binding pockets in VS. 220 Compared to the Glide VS workflow, the combination of accelerated MD simulations and Glide induced fit docking of M2R by Miao et al. provided much-improved enrichment factors and identified four PAMs and one NAM with unprecedented chemical diversity. 221 For 5-HT 1A whose crystal structure is not available, Warszycki et al. applied MC and NMA to generate an ensemble of binding pockets with the input of a homology template and known active compounds and finally discovered two new active ligands through VS. 222
Pose evaluation
Correctly selecting and ranking poses of docked compounds in the ligand-binding pockets have been a challenge for SBDD, especially for GPCR that is embedded in the cell membrane with significant conformational adaptability. To address this problem, many physics-based scoring functions 223 – 226 integrated with some user-friendly computer programs (e.g., Dock, GOLD, AutoDock, Glide, and rDock) were routinely adopted in SBDD. Recently, precise computational approaches including free energy calculation methods like molecular mechanics/Poisson–Boltzmann surface area (MM/PB(GB)SA), 227 , 228 free energy perturbation (FEP), 229 quantum mechanical/MM calculations, 230 and fragment molecular orbital 231 , 232 have been employed with improved performance. Compared to the empirical scoring functions, MM/PB(GB)SA and FEP are physically more rigorous free-energy calculation methods with an increased computational cost and have been adopted in the studies of DNA–ligand, protein–ligand, and protein–protein interactions. 233 , 234 By introducing of the minimization-based MM/GBSA refining and rescoring of docked poses, Zhou et al. identified seven 5-HT 2B antagonists with novel chemical scaffolds and the most potent one has an IC 50 of 27.3 nM in a cellular assay. 227 Lenselink et al. used FEP to design A 2A R antagonists and identified a highly potent molecule with K i of 1.2 nM. 232 However, computational investigation across 20 class A crystal structures and 934 known ligands demonstrated that the correlations between predicted binding free energy by MM/PBSA and experimental data varied significantly. The observed variations exist between individual receptors and are highly system specific, 235 indicating that successful application of MM/PBSA may require additional efforts in validation of experimental data and optimization of simulation/calculation parameters. Alternatively, protein–ligand interaction fingerprints exacted from available crystal structures 225 , 236 fueled docking score with protein–ligand-binding mode information and resulted in improved VS virtual hit rates for β 2 AR (53%) and HRH1 (73%) with up to nM affinities and potencies. 225 , 236 Collectively, innovation in VS and pose evaluation, along with evolution of computational hardware, has significantly advanced SBDD and is expected to lift the discovery efficiency to a new height, since GPCRs have multiple downstream signaling pathways responsible for distinct functions or consequences. The high degree of sequence and pocket similarities between different subtypes demands for novel ligands with superior specificity and selectivity. In this regard, allosteric and biased modulators may offer additional pharmacological benefits.
Subtype selectivity
It is known that GPCR subtypes share high sequence identities in orthosteric sites with distinct distribution and downstream signaling profiles. Cross-reactivity among subtypes could cause undesired side effects. For example, five MR subtypes display different G protein coupling features (G q/11 : M1R, M3R and M5R; G i/o : M2R and M4R) and organ distribution (CNS, M1R; peripheral tissues such as heart and colon, M2R), while their sequence identify (64–82%) and similarity (82–92%) in TMD are quite conserved. 172 Similar observations were seen among dopamine receptors (DRD1 to DRD5), histamine receptors (HRH1 to HRH4), and adenosine receptors (A 1 AR, A 2A R, A 2B R, and A 3 AR). Guided by structural information, rational design of subtype selective compounds progresses steadily. Using an extended DRD2-specific binding pocket from the haloperidol-bound DRD2 crystal structure, Fan et al. discovered two highly selective DRD2 agonists (O4SE6 and O8LE6) that specifically activate DRD2 (EC 50 = 1 µM) after screening of 320 non-olfactory GPCRs. 237 Through VS of 3.1 million molecules against M2 and M3, Kruse et al. identified a partial M3 agonist without measurable M2 agonism, capable of stimulating insulin release from a mouse β-cell line. 238 Wei et al. reported a multistage VS of the ChemDiv library (1,492,362 compounds) toward A 1 AR and discovered four novel antagonists with good affinity and selectively (>100-fold) over A 2A R. 239
Biased signaling
Recently, Suomivuori et al. performed extensive MD simulations to identify two major signaling conformations that couple effectively to arrestin or G protein, respectively. They then designed ligands via minor chemical modification resulting in strong arrestin-biased or G q -biased signal transduction. 240 Meanwhile, McCorvy and colleagues discovered that specific ECL2–ligand contacts are associated with β-arrestin recruitment, whereas blockage of TM5 interaction reduces the G i/o signaling. An arrestin-biased DRD2 modulator was thus made exhibiting a calculated bias factor of 20 relative to quinpirole. 241 In addition, Mannel et al. conducted a VS of a tailored virtual library bearing 2,3-dichlorophenylpiperazine for DRD2 and found that 18 compounds occupy both orthosteric and allosteric sites, and 4 of them stimulated β-arrestin recruitment (EC 50 = 320 nM, E max = 16%) without detectable G protein signaling. 214
Allosterism
In the past 5 years, an increasing number of receptor–allosteric modulator complex structures revealed diversified positions of allosteric sites and a variety of binding modes, thereby deepening our understanding of allosteric modulation in terms of underlying mechanisms and structural basis. Further to conventional orthosteric ligands, allosteric modulators affect receptor function in different ways. While PAM may enhance maximal efficacy, NAM could reduce agonist signaling strength. 159 , 160 , 242 It was reported that a PAM of M2R is located above the orthosteric site and interacts with ECLs. Korczynska et al. screened 4.6 million molecules against the allosteric sites of M2R and identified a PAM that potentiated the action of antagonist N -methyl scopolamine (NMS). Subsequent optimization led to a subtype-selective compound 628 that increased NMS binding with a co-operativity factor of 5.5 and a K B of 1.1 µM. 16 Alternatively, Lückmann et al. carried out MD simulations of agonist-removed FFAR1 and found that closure of a potential allosteric site is associated with agonist binding—compounds that bind to this site to prevent the closure functions as allosteric agonists. 243 Obviously, aided by >400 structures from 82 receptors, SBDD is now entering into a new era with substantial knowledge of GPCR signaling 103 , 110 , 244 and drug candidate attributes. 245
Novel screening technology
As GPCRs represent the most prominent family of therapeutic targets, 132 innumerable efforts have been made in both industry and academia to screen for novel ligands that can modulate the activity of a specified GPCR and serve as lead compounds for drug development. A diverse array of experimental technologies suitable for assaying protein–ligand interactions have been directly applied or tailored to GPCR-targeted ligand screening, and they can be classified into three main categories: binding-based, stability-based, and cell signaling-based assays (Table 7 ). Binding-based assays monitor the physical interactions between a GPCR protein typically in a purified recombinant form with individual test compounds. Cell signaling-based assays measure downstream effectors (e.g., cAMP, Ca 2+ , IP1/IP3) of specific intracellular signaling pathways known to be mediated by GPCR, which reflect the functional outcome of ligand binding to the receptor. Stability-based assays assess the variation of thermal stability for a purified protein when treated by test compounds. These different techniques vary in the ligand screening throughput and binding characteristics (Table 7 ). In the lead discovery stage, both binding- and signaling/activity-based assays are implemented in a parallel or sequential manner, as the multipronged use of complementary techniques would reduce the overall false-positive and false-negative rates. 246 Table 6 Comparison of the advantages and disadvantages of various computer-aided drug design approaches Approach Advantage Disadvantage Ligand-based drug design (LBDD) Quantitative structure–activity relationship 314 , 315 (QSAR) Understanding interactions between functional groups, convenient, does not require the structural information of a target Descriptor selection, false correlations, enough known ligands Pharmacophore modeling 316 , 317 Effective model, convenient, does not require the structural information of a target Known ligands, less novelty; missing conformations Structure-based drug design (SBDD) Virtual screening of ultra-large libraries 224 – 226 Novel scaffolds and chemotypes, higher chances of finding potent ligands Substantial computational resources, compound synthesis Virtual screening of focused libraries 227 – 229 Less demand for computational resources, compound easy to purchase, specific scaffolds or origins Reduced diversity, less novelty Ensemble docking 235 – 237 Considering protein flexibility, improved enrichment factors, rescue of false-negative ligands Increased computational burden, pose evaluation, false positive Energy-based pose evaluation 248 – 252 Improved scoring and ranking ability Substantial computational burden, method validation target dependency Here we summarize about 20 experimental screening technologies adapted to GPCR ligand discovery (Table 7 ) and highlight the most recent development of binding-based approaches. Notably, an update of assays assessing GPCR activation and signaling has been provided in a previous review 247 and will not be elaborated here. The structure-based VS is covered in the above section. Structural elucidation technologies (e.g., X-ray crystallography, single-particle cryo-EM, NMR, and HDX-MS) not suitable for high-throughput screening (HTS) are also excluded. DNA-encoded library (DEL) Impressive technological advances have been made for binding-based ligand screening over the past decade. Specifically, DEL has emerged as a powerful approach to drug discovery. 248 – 251 Created by split and pool synthesis, DEL usually contains hundreds of thousands to billions of distinct small molecule–DNA conjugates. A majority of DEL-based HTS reported to date involve incubation of an immobilized target protein with the library before the protein–ligand complexes are isolated. Encoding DNA tags associated with the immobilized target are then amplified and sequenced to assign relevant chemical structures. 250 , 251 Although DEL was predominantly applied to ligand screening against soluble proteins such as enzymes, successful adaptation of this technique to GPCRs was reported in a few cases. 252 – 255 Lefkowitz group reported the discovery of a NAM for β 2 AR by screening a DEL of 190 million. 252 This NAM not only has a unique chemotype but also exhibits low μM affinity and inhibits cAMP production as well as β-arrestin recruitment. Later on, the same team discovered the first small molecule PAM for β 2 AR through HTS of >500 million DEL compounds. 253 Both NAM and PAM demonstrated high selectivity. Of note is that the NAM was found using unliganded β 2 AR, whereas the PAM was unmasked via intentional application of β 2 AR with its orthosteric site occupied by an agonist thereby shifting the receptor to the active state. 252 , 253 These two studies elegantly demonstrated a proof-of-concept strategy for binding-based screening of allosteric modulators targeting different conformational states. 253 The power of DEL in the discovery of alloste
Affinity selection MS
Due to the high sensitivity and high selectivity of modern MS for both protein and small molecule analysis, versatile MS-based technologies have been developed in the past two decades for screening ligands of a given protein target or characterization of ligand-binding properties (Table 7 ). Almost all of them were originally developed for measuring ligand interactions with soluble proteins, 256 – 261 and recently they have been adapted to more challenging GPCR drug discovery. The majority of MS-based technologies (e.g., automated ligand identification system (ALIS), ultrafiltration–liquid chromatography/MS, frontal affinity chromatography–MS, membrane-based affinity MS, and competitive MS binding) employ a methodology very similar to DEL as they all capture and detect ligands physically associated with a given GPCR, 262 – 267 except that native MS analyzes the entire ligand-bound receptor complexes. 268 , 269 In general, these methods have several advantages over ligand binding or cell signaling assays: (i) unbiased and direct detection of ligand–receptor binding facilitates the identification of both orthosteric and allosteric modulators; (ii) confirmation of ligand identity with accurate mass measurement; (iii) no chemical labeling or DNA encoding of test compounds; and (iv) quantitative MS analysis enables ranking of ligand affinity or evaluation of binding characteristics. Table 7 Diverse GPCR ligand screening technologies classified into three categories Category Method Assay principle Readout Activity characterization Throughput Binding-based assay Radiolabeled ligand binding Detect binding of a radioisotope-labeled ligand to a target in competition with a test compound Radioactivity K i , K on , K off Medium DEL DNA-encoded compounds bound to a target are affinity selected and their structures revealed by DNA sequencing DNA sequence Affinity ranking Ultra-high SPR Detect changes in the refractive index of the gold film surface when ligands interact with a target immobilized on the chip surface Refractive index (mass on surface) K d , stoichiometry ( n ), K on , K off Medium MST Detect directed movement of molecules through a temperature gradient using covalently attached or intrinsic fluorophores Fluorescence intensity K d , stoichiometry ( n ) Medium TR-FRET Detect fluorescence resonance energy transfer caused by interaction between target and ligand labeled with specific fluorophores Fluorescence intensity K i Medium ALIS Target–ligand complexes are isolated by fast SEC, and dissociated ligands are identified by high-res MS m / z and MS intensity Affinity ranking, K d , ACE 50 High Membrane-based affinity MS Target–ligand complexes in the cell membrane are separated from solution by filtration prior to ligand identification by high-res MS m / z and MS intensity Affinity ranking High UF-LC/MS Target–ligand complexes are separated from solution by ultrafiltration prior to ligand identification by LC-MS m / z and MS intensity Affinity ranking, K d High FAC-MS Detect ligands flowing through a protein-immobilized column based on breakthrough curves determined by MS m / z and MS intensity Affinity ranking, K d Medium Competitive MS binding Detect binding of a non-radioactive ligand to a target in competition with a test compound by MRM-based MS analysis m / z and MS intensity K i , K on , K off Low Native MS Detect intact protein–ligand complexes in the gas phase by MS m / z and MS intensity K d , stoichiometry ( n ) Low Stability-based assay DSF Detect changes in protein fluorescence over a temperature gradient Fluorescence intensity T m Medium DLS Measure changes in the protein aggregate size based on static light scattering properties over a temperature gradient Light scattering intensity T agg Medium Cell signaling assay GTPγS Detect 35 S-GTPγS binding to GPCR-expressing cell membranes as a result of receptor activation Radioactivity EC 50 , IC 50 High cAMP Detect cellular levels of cAMP coupled to Gα s or Gα i activation Luminescence/fluorescence EC 50 , IC 50 High Ca 2+ Detect cellular levels of free Ca 2+ coupled to Gα q/11 or Gα 15/16 activation Fluorescence EC 50 , IC 50 High IP3/IP1 Detect cellular levels of IP3 coupled to Gα q or Gα i activation Fluorescence EC 50 , IC 50 High Luciferase reporter gene Measure reporter gene transcriptional activity downstream of receptor activation Luminescence EC 50 , IC 50 High β-arrestin recruitment Detect cellular levels of β-arrestin along with receptor endocytosis Luminescence EC 50 , IC 50 Medium–high DEL DNA-encoded library, SPR surface plasmon resonance, MST microscale thermophoresis, ALIS automated ligand identification system, UF ultrafiltration, FAC frontal affinity chromatography, DSF differential scanning fluorimetry, DLS dynamin light scattering ALIS is currently the most prevailing MS-based technique employed in pharmaceutical companies for HTS of large-scale synthetic compound libraries. 256 , 261 , 270 – 273 This system integrates size exclus
Emerging opportunities and prospects
Recent scientific and technological advancements in GPCR biology have provided an enormous amount of information that will benefit our current and future efforts in rational drug design. Integration and refinement of massive data by artificial intelligence is a clear direction to guide both virtual and experimental screening of efficacious therapeutic agents with new scaffolds and of novel chemotypes for all classes of GPCRs. However, as described in this review, factors that influence GPCR drug discovery include, but not limited to, therapeutic target, chemical diversity, mechanism of signaling, ligand-binding site, mode of action, clinical indication, polypharmacology, etc. Future opportunities may arise from: (i) de-orphanization of orphan GPCRs to provide novel targets; (ii) new indication for drug intervention via discovery and/or repurposing efforts; (iii) development of lead compounds targeting classes B2 and F GPCRs to address unmet medical needs; and (iv) validation of polypharmacology may lead to improved drug therapies.
Supplementary information
The online version of this article (10.1038/s41392-020-00435-w) contains supplementary material, which is available to authorized users.
| DOI | 10.1038/s41392-020-00435-w |
| PubMed ID | 33414387 |
| PMC ID | PMC7790836 |
| Journal | Signal Transduction and Targeted Therapy |
| Year | 2021 |
| Authors | Dehua Yang, Qingtong Zhou, Viktorija Labroska, Shanshan Qin, Sanaz Darbalaei, Yiran Wu, Elita Yuliantie, Linshan Xie, Houchao Tao, Jianjun Cheng, Qing Liu, Suwen Zhao, Wenqing Shui, Yi Jiang, Ming‐Wei Wang |
| License | Open Access — see publisher for license terms |
| Citations | 614 |