Cancer remains a leading cause of mortality worldwide and calls for novel therapeutic targets. Membrane proteins are key players in various cancer types but present unique challenges compared to... Show moreCancer remains a leading cause of mortality worldwide and calls for novel therapeutic targets. Membrane proteins are key players in various cancer types but present unique challenges compared to soluble proteins. The advent of computational drug discovery tools offers a promising approach to address these challenges, allowing for the prioritization of "wet-lab" experiments. In this review, we explore the applications of computational approaches in membrane protein oncological characterization, particularly focusing on three prominent membrane protein families: receptor tyrosine kinases (RTKs), G protein-coupled receptors (GPCRs), and solute carrier proteins (SLCs). We chose these families due to their varying levels of understanding and research data availability, which leads to distinct challenges and opportunities for computational analysis. We discuss the utilization of multi-omics data, machine learning, and structure-based methods to investigate aberrant protein functionalities associated with cancer progression within each family. Moreover, we highlight the importance of considering the broader cellular context and, in particular, cross-talk between proteins. Despite existing challenges, computational tools hold promise in dissecting membrane protein dysregulation in cancer. With advancing computational capabilities and data resources, these tools are poised to play a pivotal role in identifying and prioritizing membrane proteins as personalized anticancer targets. Show less
Developments in computational omics technologies have provided new means to access the hidden diversity of natural products, unearthing new potential for drug discovery. In parallel, artificial... Show moreDevelopments in computational omics technologies have provided new means to access the hidden diversity of natural products, unearthing new potential for drug discovery. In parallel, artificial intelligence approaches such as machine learning have led to exciting developments in the computational drug design field, facilitating biological activity prediction and de novo drug design for molecular targets of interest. Here, we describe current and future synergies between these developments to effectively identify drug candidates from the plethora of molecules produced by nature. We also discuss how to address key challenges in realizing the potential of these synergies, such as the need for high-quality datasets to train deep learning algorithms and appropriate strategies for algorithm validation. Show less
Gorostiola González, M.; Broek, R.L. van den; Braun, T.G.M.; Chatzopoulou Chatzi A.; Jespers, W.; IJzerman, A.P.; ... ; Westen, G.J.P. van 2023
Proteochemometric (PCM) modelling is a powerful computational drug discovery tool used in bioactivity prediction of potential drug candidates relying on both chemical and protein information. In... Show moreProteochemometric (PCM) modelling is a powerful computational drug discovery tool used in bioactivity prediction of potential drug candidates relying on both chemical and protein information. In PCM features are computed to describe small molecules and proteins, which directly impact the quality of the predictive models. State-of-the-art protein descriptors, however, are calculated from the protein sequence and neglect the dynamic nature of proteins. This dynamic nature can be computationally simulated with molecular dynamics (MD). Here, novel 3D dynamic protein descriptors (3DDPDs) were designed to be applied in bioactivity prediction tasks with PCM models. As a test case, publicly available G protein-coupled receptor (GPCR) MD data from GPCRmd was used. GPCRs are membrane-bound proteins, which are activated by hormones and neurotransmitters, and constitute an important target family for drug discovery. GPCRs exist in different conformational states that allow the transmission of diverse signals and that can be modified by ligand interactions, among other factors. To translate the MD-encoded protein dynamics two types of 3DDPDs were considered: one-hot encoded residue-specific (rs) and embedding-like protein-specific (ps) 3DDPDs. The descriptors were developed by calculating distributions of trajectory coordinates and partial charges, applying dimensionality reduction, and subsequently condensing them into vectors per residue or protein, respectively. 3DDPDs were benchmarked on several PCM tasks against state-of-the-art non-dynamic protein descriptors. Our rs- and ps3DDPDs outperformed non-dynamic descriptors in regression tasks using a temporal split and showed comparable performance with a random split and in all classification tasks. Combinations of non-dynamic descriptors with 3DDPDs did not result in increased performance. Finally, the power of 3DDPDs to capture dynamic fluctuations in mutant GPCRs was explored. The results presented here show the potential of including protein dynamic information on machine learning tasks, specifically bioactivity prediction, and open opportunities for applications in drug discovery, including oncology. Show less
Beerkens, B.L.H.; Snijders, I.M.; Snoeck, J.; Liu, R.; Tool, A.T.J.; Le Dévédec, S.E.; ... ; Es, D. van der 2023
The adenosine A3 receptor (A3AR) is a G protein-coupled receptor (GPCR) that exerts immunomodulatory effects in pathophysiological conditions such as inflammation and cancer. Thus far, studies... Show moreThe adenosine A3 receptor (A3AR) is a G protein-coupled receptor (GPCR) that exerts immunomodulatory effects in pathophysiological conditions such as inflammation and cancer. Thus far, studies toward the downstream effects of A3AR activation have yielded contradictory results, thereby motivating the need for further investigations. Various chemical and biological tools have been developed for this purpose, ranging from fluorescent ligands to antibodies. Nevertheless, these probes are limited by their reversible mode of binding, relatively large size, and often low specificity. Therefore, in this work, we have developed a clickable and covalent affinity-based probe (AfBP) to target the human A3AR. Herein, we show validation of the synthesized AfBP in radioligand displacement, SDS-PAGE, and confocal microscopy experiments as well as utilization of the AfBP for the detection of endogenous A3AR expression in flow cytometry experiments. Ultimately, this AfBP will aid future studies toward the expression and function of the A3AR in pathologies. Show less
Schoenmaker, L.; Béquignon, O.J.M.; Jespers, W.; Westen, G.J.P. van 2023
Generative deep learning models have emerged as a powerful approach for de novo drug design as they aid researchers in finding new molecules with desired properties. Despite continuous improvements... Show moreGenerative deep learning models have emerged as a powerful approach for de novo drug design as they aid researchers in finding new molecules with desired properties. Despite continuous improvements in the field, a subset of the outputs that sequence-based de novo generators produce cannot be progressed due to errors. Here, we propose to fix these invalid outputs post hoc. In similar tasks, transformer models from the field of natural language processing have been shown to be very effective. Therefore, here this type of model was trained to translate invalid Simplified Molecular-Input Line-Entry System (SMILES) into valid representations. The performance of this SMILES corrector was evaluated on four representative methods of de novo generation: a recurrent neural network (RNN), a target-directed RNN, a generative adversarial network (GAN), and a variational autoencoder (VAE). This study has found that the percentage of invalid outputs from these specific generative models ranges between 4 and 89%, with different models having different error-type distributions. Post hoc correction of SMILES was shown to increase model validity. The SMILES corrector trained with one error per input alters 60-90% of invalid generator outputs and fixes 35-80% of them. However, a higher error detection and performance was obtained for transformer models trained with multiple errors per input. In this case, the best model was able to correct 60-95% of invalid generator outputs. Further analysis showed that these fixed molecules are comparable to the correct molecules from the de novo generators based on novelty and similarity. Additionally, the SMILES corrector can be used to expand the amount of interesting new molecules within the targeted chemical space. Introducing different errors into existing molecules yields novel analogs with a uniqueness of 39% and a novelty of approximately 20%. The results of this research demonstrate that SMILES correction is a viable post hoc extension and can enhance the search for better drug candidates. Show less
Béquignon, O.J.M.; Bongers, B.J.; Jespers, W.; IJzerman, A.P.; Water, B. van de; Westen, G.J.P. van 2023
With the ongoing rapid growth of publicly available ligand-protein bioactivity data, there is a trove of valuable data that can be used to train a plethora of machine-learning algorithms. However,... Show moreWith the ongoing rapid growth of publicly available ligand-protein bioactivity data, there is a trove of valuable data that can be used to train a plethora of machine-learning algorithms. However, not all data is equal in terms of size and quality and a significant portion of researchers' time is needed to adapt the data to their needs. On top of that, finding the right data for a research question can often be a challenge on its own. To meet these challenges, we have constructed the Papyrus dataset. Papyrus is comprised of around 60 million data points. This dataset contains multiple large publicly available datasets such as ChEMBL and ExCAPE-DB combined with several smaller datasets containing high-quality data. The aggregated data has been standardised and normalised in a manner that is suitable for machine learning. We show how data can be filtered in a variety of ways and also perform some examples of quantitative structure-activity relationship analyses and proteochemometric modelling. Our ambition is that this pruned data collection constitutes a benchmark set that can be used for constructing predictive models, while also providing an accessible data source for research. Show less
Bongers, B.J.; Gorostiola González, M.; Wang, X.; Vlijmen, H. van; Jespers, W.; Gutiérrez-de-Terán, H.; ... ; Westen, G.J.P. van 2022
G Protein-coupled receptors (GPCRs) are the most frequently exploited drug target family, moreover they are often found mutated in cancer. Here we used a dataset of mutations found in patient... Show moreG Protein-coupled receptors (GPCRs) are the most frequently exploited drug target family, moreover they are often found mutated in cancer. Here we used a dataset of mutations found in patient samples derived from the Genomic Data Commons and compared it to the natural human variance as exemplified by data from the 1000 genomes project. We explored cancer-related mutation patterns in all GPCR classes combined and individually. While the location of the mutations across the protein domains did not differ significantly in the two datasets, a mutation enrichment in cancer patients was observed among class-specific conserved motifs in GPCRs such as the Class A "DRY" motif. A Two-Entropy Analysis confirmed the correlation between residue conservation and cancer-related mutation frequency. We subsequently created a ranking of high scoring GPCRs, using a multi-objective approach (Pareto Front Ranking). Our approach was confirmed by re-discovery of established cancer targets such as the LPA and mGlu receptor families, but also discovered novel GPCRs which had not been linked to cancer before such as the P2Y Receptor 10 (P2RY10). Overall, this study presents a list of GPCRs that are amenable to experimental follow up to elucidate their role in cancer. Show less
The adenosine A(2A) receptor (A(2A)AR) is a class A G-protein-coupled receptor (GPCR). It is an immune checkpoint in the tumor micro-environment and has become an emerging target for cancer... Show moreThe adenosine A(2A) receptor (A(2A)AR) is a class A G-protein-coupled receptor (GPCR). It is an immune checkpoint in the tumor micro-environment and has become an emerging target for cancer treatment. In this study, we aimed to explore the effects of cancer-patient-derived A(2A)AR mutations on ligand binding and receptor functions. The wild-type A(2A)AR and 15 mutants identified by Genomic Data Commons (GDC) in human cancers were expressed in HEK293T cells. Firstly, we found that the binding affinity for agonist NECA was decreased in six mutants but increased for the V275A mutant. Mutations A165V and A265V decreased the binding affinity for antagonist ZM241385. Secondly, we found that the potency of NECA (EC50) in an impedance-based cell-morphology assay was mostly correlated with the binding affinity for the different mutants. Moreover, S132L and H278N were found to shift the A(2A)AR towards the inactive state. Importantly, we found that ZM241385 could not inhibit the activation of V275A and P285L stimulated by NECA. Taken together, the cancer-associated mutations of A(2A)AR modulated ligand binding and receptor functions. This study provides fundamental insights into the structure-activity relationship of the A(2A)AR and provides insights for A(2A)AR-related personalized treatment in cancer. Show less
Overexpression of the adenosine A1 receptor (A1AR) has been detected in various cancer cell lines. However, the role of A1AR in tumor development is still unclear. Thirteen A1AR mutations were... Show moreOverexpression of the adenosine A1 receptor (A1AR) has been detected in various cancer cell lines. However, the role of A1AR in tumor development is still unclear. Thirteen A1AR mutations were identified in the Cancer Genome Atlas from cancer patient samples. We have investigated the pharmacology of the mutations located at the 7-transmembrane domain using a yeast system. Concentration-growth curves were obtained with the full agonist CPA and compared to the wild type hA1AR. H78L3.23 and S246T6.47 showed increased constitutive activity, while only the constitutive activity of S246T6.47 could be reduced to wild type levels by the inverse agonist DPCPX. Decreased constitutive activity was observed on five mutant receptors, among which A52V2.47 and W188C5.46 showed a diminished potency for CPA. Lastly, a complete loss of activation was observed in five mutant receptors. A selection of mutations was also investigated in a mammalian system, showing comparable effects on receptor activation as in the yeast system, except for residues pointing toward the membrane. Taken together, this study will enrich the view of the receptor structure and function of A1AR, enlightening the consequences of these mutations in cancer. Ultimately, this may provide an opportunity for precision medicine for cancer patients with pathological phenotypes involving these mutations. Show less
G protein-coupled receptors (GPCRs) are known to be involved in tumor progression and metastasis. The adenosine A1 receptor (A1 AR) has been detected to be over-expressed in various cancer cell... Show moreG protein-coupled receptors (GPCRs) are known to be involved in tumor progression and metastasis. The adenosine A1 receptor (A1 AR) has been detected to be over-expressed in various cancer cell lines. However, the role of A1 AR in tumor development is not yet well characterized. A series of A1 AR mutations were identified in the Cancer Genome Atlas from cancer patient samples. In this study, we have investigated the pharmacology of mutations located outside of the 7-transmembrane domain by using a "single-GPCR-one-G protein" yeast system. Concentration-growth curves were obtained with the full agonist CPA for 12 mutant receptors and compared to the wild-type hA1 AR. Most mutations located at the extracellular loops (EL) reduced the levels of constitutive activity of the receptor and agonist potency. For mutants at the intracellular loops (ILs) of the receptor, an increased constitutive activity was found for mutant receptor L211R5.69 , while a decreased constitutive activity and agonist response were found for mutant receptor L113F34.51 . Lastly, mutations identified on the C-terminus did not significantly influence the pharmacological function of the receptor. A selection of mutations was also investigated in a mammalian system. Overall, similar effects on receptor activation compared to the yeast system were found with mutations located at the EL, but some contradictory effects were observed for mutations located at the IL. Taken together, this study will enrich the insight of A1 AR structure and function, enlightening the consequences of these mutations in cancer. Ultimately, this may provide potential precision medicine in cancer treatment. Show less
Beerkens, B.L.H.; Wang, X.; Avgeropoulou, M.; Adistia, L.N.; Veldhoven, J.P.D. van; Jespers, W.; ... ; Es, D. van der 2022
Transmembranal G Protein-Coupled Receptors (GPCRs) transduce extracellular chemical signals to the cell, via conformational change from a resting (inactive) to an active (canonically bound to a G... Show moreTransmembranal G Protein-Coupled Receptors (GPCRs) transduce extracellular chemical signals to the cell, via conformational change from a resting (inactive) to an active (canonically bound to a G-protein) conformation. Receptor activation is normally modulated by extracellular ligand binding, but mutations in the receptor can also shift this equilibrium by stabilizing different conformational states. In this work, we built structure-energetic relationships of receptor activation based on original thermodynamic cycles that represent the conformational equilibrium of the prototypical A2A adenosine receptor (AR). These cycles were solved with efficient free energy perturbation (FEP) protocols, allowing to distinguish the pharmacological profile of different series of A2AAR agonists with different efficacies. The modulatory effects of point mutations on the basal activity of the receptor or on ligand efficacies could also be detected. This methodology can guide GPCR ligand design with tailored pharmacological properties, or allow the identification of mutations that modulate receptor activation with potential clinical implications. Show less
Nicotinamide N-methyltransferase (NNMT) methylates nicotinamide (vitamin B3) to generate 1-methylnicotinamide (MNA). NNMT overexpression has been linked to a variety of diseases, most prominently... Show moreNicotinamide N-methyltransferase (NNMT) methylates nicotinamide (vitamin B3) to generate 1-methylnicotinamide (MNA). NNMT overexpression has been linked to a variety of diseases, most prominently human cancers, indicating its potential as a therapeutic target. The development of small-molecule NNMT inhibitors has gained interest in recent years, with the most potent inhibitors sharing structural features based on elements of the nicotinamide substrate and the S-adenosyl-l-methionine (SAM) cofactor. We here report the development of new bisubstrate inhibitors that include electron-deficient aromatic groups to mimic the nicotinamide moiety. In addition, a trans-alkene linker was found to be optimal for connecting the substrate and cofactor mimics in these inhibitors. The most potent NNMT inhibitor identified exhibits an IC50 value of 3.7 nM, placing it among the most active NNMT inhibitors reported to date. Complementary analytical techniques, modeling studies, and cell-based assays provide insights into the binding mode, affinity, and selectivity of these inhibitors. Show less
The four adenosine receptors (ARs) A1AR, A2AAR, A2BAR, and A3AR are G protein-coupled receptors (GPCRs) for which an exceptional amount of experimental and structural data is available. Still,... Show moreThe four adenosine receptors (ARs) A1AR, A2AAR, A2BAR, and A3AR are G protein-coupled receptors (GPCRs) for which an exceptional amount of experimental and structural data is available. Still, limited success has been achieved in getting new chemical modulators on the market. As such, there is a clear interest in the design of novel selective chemical entities for this family of receptors. In this work, we investigate the selective recognition of ISAM-140, a recently reported A2BAR reference antagonist. A combination of semipreparative chiral HPLC, circular dichroism and X-ray crystallography was used to separate and unequivocally assign the configuration of each enantiomer. Subsequently affinity evaluation for both A2A and A2B receptors demonstrate the stereospecific and selective recognition of (S)-ISAM140 to the A2BAR. The molecular modeling suggested that the structural determinants of this selectivity profile would be residue V2506.51 in A2BAR, which is a leucine in all other ARs including the closely related A2AAR. This was herein confirmed by radioligand binding assays and rigorous free energy perturbation (FEP) calculations performed on the L249V6.51 mutant A2AAR receptor. Taken together, this study provides further insights in the binding mode of these A2BAR antagonists, paving the way for future ligand optimization. Show less
In an attempt to exploit the hydrolytic mechanism by which β-lactamases degrade cephalosporins, we designed and synthesized a series of novel cephalosporin prodrugs aimed at delivering thiol-based... Show moreIn an attempt to exploit the hydrolytic mechanism by which β-lactamases degrade cephalosporins, we designed and synthesized a series of novel cephalosporin prodrugs aimed at delivering thiol-based inhibitors of metallo-β-lactamases (MBLs) in a spatiotemporally controlled fashion. While enzymatic hydrolysis of the β-lactam ring was observed, it was not accompanied by inhibitor release. Nonetheless, the cephalosporin prodrugs, especially thiomandelic acid conjugate (8), demonstrated potent inhibition of IMP-type MBLs. In addition, conjugate 8 was also found to greatly reduce the minimum inhibitory concentration of meropenem against IMP-producing bacteria. The results of kinetic experiments indicate that these prodrugs inhibit IMP-type MBLs by acting as slowly turned-over substrates. Structure–activity relationship studies revealed that both phenyl and carboxyl moieties of 8 are crucial for its potency. Furthermore, modeling studies indicate that productive interactions of the thiomandelic acid moiety of 8 with Trp28 within the IMP active site may contribute to its potency and selectivity. Show less
TrendsRecent technological advances in membrane protein crystallization have resulted in a nearly exponential increase of available receptor structures. The AR family is an important example in... Show moreTrendsRecent technological advances in membrane protein crystallization have resulted in a nearly exponential increase of available receptor structures. The AR family is an important example in this respect. Crystal structures of antagonist- and agonist-bound adenosine A2A receptor have recently been supplemented by a fully activated conformation in complex with a G-protein mimic, and by antagonist bound structures of the A1 receptor.SDM experiments have been essential to identify residues involved in molecular interactions between ARs and their ligands. Leveraging on recent crystal structures, this vast amount of data can now be systematically classified and interconnected with chemical and structural information of ligands and receptors.The mapping of mutational data onto crystal structures provides new understanding of molecular interactions involved in ligand recognition. Together with computational modeling, this can be used as a roadmap to create novel hypotheses and assist in the design of more systematic mutagenesis studies to answer remaining structural and functional questions.The four adenosine receptors (ARs), A1, A2A, A2B, and A3, constitute a subfamily of G protein-coupled receptors (GPCRs) with exceptional foundations for structure-based ligand design. The vast amount of mutagenesis data, accumulated in the literature since the 1990s, has been recently supplemented with structural information, currently consisting of several inactive and active structures of the A2A and inactive conformations of the A1 ARs. We provide the first integrated view of the pharmacological, biochemical, and structural data available for this receptor family, by mapping onto the relevant crystal structures all site-directed mutagenesis data, curated and deposited at the GPCR database (available through http://www.gpcrdb.org). This analysis provides novel insights into ligand binding, allosteric modulation, and signaling of the AR family.Keywords: G protein-coupled receptor, adenosine receptor, mutagenesis, chemical modulationShow less