Compounds that are candidates for drug repurposing can be ranked by leveraging knowledge available in the biomedical literature and databases. This knowledge, spread across a variety of sources,... Show moreCompounds that are candidates for drug repurposing can be ranked by leveraging knowledge available in the biomedical literature and databases. This knowledge, spread across a variety of sources, can be integrated within a knowledge graph, which thereby comprehensively describes known relationships between biomedical concepts, such as drugs, diseases, genes, etc. Our work uses the semantic information between drug and disease concepts as features, which are extracted from an existing knowledge graph that integrates 200 different biological knowledge sources. RepoDB, a standard drug repurposing database which describes drug-disease combinations that were approved or that failed in clinical trials, is used to train a random forest classifier. The 10-times repeated 10-fold cross-validation performance of the classifier achieves a mean area under the receiver operating characteristic curve (AUC) of 92.2%. We apply the classifier to prioritize 21 preclinical drug repurposing candidates that have been suggested for Autosomal Dominant Polycystic Kidney Disease (ADPKD). Mozavaptan, a vasopressin V2 receptor antagonist is predicted to be the drug most likely to be approved after a clinical trial, and belongs to the same drug class as tolvaptan, the only treatment for ADPKD that is currently approved. We conclude that semantic properties of concepts in a knowledge graph can be exploited to prioritize drug repurposing candidates for testing in clinical trials. Show less
Medication for nonalcoholic fatty liver disease (NAFLD) is an unmet need. Glucocorticoid (GC) stress hormones drive fat metabolism in the liver, but both full blockade and full stimulation of GC... Show moreMedication for nonalcoholic fatty liver disease (NAFLD) is an unmet need. Glucocorticoid (GC) stress hormones drive fat metabolism in the liver, but both full blockade and full stimulation of GC signaling aggravate NAFLD pathology. We investigated the efficacy of selective glucocorticoid receptor (GR) modulator CORT118335, which recapitulates only a subset of GC actions, in reducing liver lipid accumulation in mice. Male C57BL/6J mice received a low-fat diet or high-fat diet mixed with vehicle or CORT118335. Livers were analyzed histologically and for genome-wide mRNA expression. Functionally, hepatic long-chain fatty acid (LCFA) composition was determined by gas chromatography. We determined very-low-density lipoprotein (VLDL) production by treatment with a lipoprotein lipase inhibitor after which blood was collected to isolate radiolabeled VLDL particles and apoB proteins. CORT118335 strongly prevented and reversed hepatic lipid accumulation. Liver transcriptome analysis showed increased expression of GR target genes involved in VLDL production. Accordingly, CORT118335 led to increased lipidation of VLDL particles, mimicking physiological GC action. Independent pathway analysis revealed that CORT118335 lacked induction of GC-responsive genes involved in cholesterol synthesis and LCFA uptake, which was indeed reflected in unaltered hepatic LCFA uptake in vivo. Our data thus reveal that the robust hepatic lipid-lowering effect of CORT118335 is due to a unique combination of GR-dependent stimulation of lipid (VLDL) efflux from the liver, with a lack of stimulation of GR-dependent hepatic fatty acid uptake. Our findings firmly demonstrate the potential use of CORT118335 in the treatment of NAFLD and underscore the potential of selective GR modulation in metabolic disease. Show less
Toonen, L.J.A.; Overzier, M.; Evers, M.M.; Leon, L.G.; Zeeuw, S.A.J. van der; Mei, H.L.; ... ; Roon-Mom, W.M.C. van 2018
Background: Spinocerebellar ataxia type 3 (SCA3) is a progressive neurodegenerative disorder caused by expansion of the polyglutamine repeat in the ataxin-3 protein. Expression of mutant ataxin-3... Show moreBackground: Spinocerebellar ataxia type 3 (SCA3) is a progressive neurodegenerative disorder caused by expansion of the polyglutamine repeat in the ataxin-3 protein. Expression of mutant ataxin-3 is known to result in transcriptional dysregulation, which can contribute to the cellular toxicity and neurodegeneration. Since the exact causative mechanisms underlying this process have not been fully elucidated, gene expression analyses in brains of transgenic SCA3 mouse models may provide useful insights.Methods: Here we characterised the MJD84.2 SCA3 mouse model expressing the mutant human ataxin-3 gene using a multi-omics approach on brain and blood. Gene expression changes in brainstem, cerebellum, striatum and cortex were used to study pathological changes in brain, while blood gene expression and metabolites/lipids levels were examined as potential biomarkers for disease.Results: Despite normal motor performance at 17.5 months of age, transcriptional changes in brain tissue of the SCA3 mice were observed. Most transcriptional changes occurred in brainstem and striatum, whilst cerebellum and cortex were only modestly affected. The most significantly altered genes in SCA3 mouse brain were Tmc3, Zfp488, Cart, and Chdh. Based on the transcriptional changes, a-adrenergic and CREB pathways were most consistently altered for combined analysis of the four brain regions. When examining individual brain regions, axon guidance and synaptic transmission pathways were most strongly altered in striatum, whilst brainstem presented with strongest alterations in the pi-3 k cascade and cholesterol biosynthesis pathways. Similar to other neurodegenerative diseases, reduced levels of tryptophan and increased levels of ceramides, di- and triglycerides were observed in SCA3 mouse blood.Conclusions: The observed transcriptional changes in SCA3 mouse brain reveal parallels with previous reported neuropathology in patients, but also shows brain region specific effects as well as involvement of adrenergic signalling and CREB pathway changes in SCA3. Importantly, the transcriptional changes occur prior to onset of motor- and coordination deficits. Show less
Mastrokolias, A.; Pool, R.; Mina, E.; Hettne, K.M.; Duijn, E. van; Mast, R.C. van der; ... ; Roon-Mom, W. van 2016
Introduction Metabolic changes have been frequently associated with Huntington's disease (HD). At the same time peripheral blood represents aminimally invasive sampling avenue with little distress... Show moreIntroduction Metabolic changes have been frequently associated with Huntington's disease (HD). At the same time peripheral blood represents aminimally invasive sampling avenue with little distress to Huntington's disease patients especially when brain or other tissue samples are difficult to collect.Objectives We investigated the levels of 163 metabolites in HD patient and control serum samples in order to identify disease related changes. Additionally, we integrated the metabolomics data with our previously published next generation sequencing-based gene expression data from the same patients in order to interconnect the metabolomics changes with transcriptional alterations. Methods This analysis was performed using targeted metabolomics and flow injection electrospray ionization tandem mass spectrometry in 133 serum samples from 97 Huntington's disease patients (29 pre-symptomatic and 68 symptomatic) and 36 controls.Results By comparing HD mutation carriers with controls we identified 3 metabolites significantly changed in HD (serine and threonine and one phosphatidylcholine-PC ae C36:0) and an additional 8 phosphatidylcholines (PC aa C38:6, PC aa C36:0, PC ae C38:0, PC aa C38:0, PC ae C38:6, PC ae C42:0, PC aa C36:5 and PC ae C36:0) that exhibited a significant association with disease severity. Using workflow based exploitation of pathway databases and by integrating our metabolomics data with our gene expression data from the same patients we identified 4 deregulated phosphatidylcholine metabolism related genes (ALDH1B1, MBOAT1, MTRR and PLB1) that showed significant association with the changes in metabolite concentrations.Conclusion Our results support the notion that phosphatidylcholine metabolism is deregulated in HD blood and that these metabolite alterations are associated with specific gene expression changes. Show less
Mina, E.; Roon-Mom, W. van; Hettne, K.M.; Zwet, E. van; Goeman, J.; Neri, C.; ... ; Roos, M. 2016
Background: Huntington's disease (HD) is a devastating brain disorder with no effective treatment or cure available. The scarcity of brain tissue makes it hard to study changes in the brain and... Show moreBackground: Huntington's disease (HD) is a devastating brain disorder with no effective treatment or cure available. The scarcity of brain tissue makes it hard to study changes in the brain and impossible to perform longitudinal studies. However, peripheral pathology in HD suggests that it is possible to study the disease using peripheral tissue as a monitoring tool for disease progression and/or efficacy of novel therapies. In this study, we investigated if blood can be used to monitor disease severity and progression in brain. Since previous attempts using only gene expression proved unsuccessful, we compared blood and brain Huntington's disease signatures in a functional context.Methods: Microarray HD gene expression profiles from three brain regions were compared to the transcriptome of HD blood generated by next generation sequencing. The comparison was performed with a combination of weighted gene co-expression network analysis and literature based functional analysis (Concept Profile Analysis). Uniquely, our comparison of blood and brain datasets was not based on (the very limited) gene overlap but on the similarity between the gene annotations in four different semantic categories: "biological process", "cellular component", "molecular function" and "disease or syndrome".Results: We identified signatures in HD blood reflecting a broad pathophysiological spectrum, including alterations in the immune response, sphingolipid biosynthetic processes, lipid transport, cell signaling, protein modification, spliceosome, RNA splicing, vesicle transport, cell signaling and synaptic transmission. Part of this spectrum was reminiscent of the brain pathology. The HD signatures in caudate nucleus and BA4 exhibited the highest similarity with blood, irrespective of the category of semantic annotations used. BA9 exhibited an intermediate similarity, while cerebellum had the least similarity. We present two signatures that were shared between blood and brain: immune response and spinocerebellar ataxias.Conclusions: Our results demonstrate that HD blood exhibits dysregulation that is similar to brain at a functional level, but not necessarily at the level of individual genes. We report two common signatures that can be used to monitor the pathology in brain of HD patients in a non-invasive manner. Our results are an exemplar of how signals in blood data can be used to represent brain disorders. Our methodology can be used to study disease specific signatures in diseases where heterogeneous tissues are involved in the pathology. Show less
Background: One of the main challenges for biomedical research lies in the computer-assisted integrative study of large and increasingly complex combinations of data in order to understand... Show moreBackground: One of the main challenges for biomedical research lies in the computer-assisted integrative study of large and increasingly complex combinations of data in order to understand molecular mechanisms. The preservation of the materials and methods of such computational experiments with clear annotations is essential for understanding an experiment, and this is increasingly recognized in the bioinformatics community. Our assumption is that offering means of digital, structured aggregation and annotation of the objects of an experiment will provide necessary meta-data for a scientist to understand and recreate the results of an experiment. To support this we explored a model for the semantic description of a workflow-centric Research Object (RO), where an RO is defined as a resource that aggregates other resources, e. g., datasets, software, spreadsheets, text, etc. We applied this model to a case study where we analysed human metabolite variation by workflows.Results: We present the application of the workflow-centric RO model for our bioinformatics case study. Three workflows were produced following recently defined Best Practices for workflow design. By modelling the experiment as an RO, we were able to automatically query the experiment and answer questions such as "which particular data was input to a particular workflow to test a particular hypothesis?", and "which particular conclusions were drawn from a particular workflow?".Conclusions: Applying a workflow-centric RO model to aggregate and annotate the resources used in a bioinformatics experiment, allowed us to retrieve the conclusions of the experiment in the context of the driving hypothesis, the executed workflows and their input data. The RO model is an extendable reference model that can be used by other systems as well.Availability: The Research Object is available at http://www.myexperiment.org/packs/428 The Wf4Ever Research Object Model is available at http://wf4ever.github.io/ro Show less
Background: Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) that associate with clinical phenotypes, but these SNPs usually explain just a... Show moreBackground: Genome-wide association studies (GWAS) have identified many common single nucleotide polymorphisms (SNPs) that associate with clinical phenotypes, but these SNPs usually explain just a small part of the heritability and have relatively modest effect sizes. In contrast, SNPs that associate with metabolite levels generally explain a higher percentage of the genetic variation and demonstrate larger effect sizes. Still, the discovery of SNPs associated with metabolite levels is challenging since testing all metabolites measured in typical metabolomics studies with all SNPs comes with a severe multiple testing penalty. We have developed an automated workflow approach that utilizes prior knowledge of biochemical pathways present in databases like KEGG and BioCyc to generate a smaller SNP set relevant to the metabolite. This paper explores the opportunities and challenges in the analysis of GWAS of metabolomic phenotypes and provides novel insights into the genetic basis of metabolic variation through the re-analysis of published GWAS datasets.Results: Re-analysis of the published GWAS dataset from Illig et al. (Nature Genetics, 2010) using a pathway-based workflow (http://www.myexperiment.org/packs/319.html), confirmed previously identified hits and identified a new locus of human metabolic individuality, associating Aldehyde dehydrogenase family1 L1 (ALDH1L1) with serine/glycine ratios in blood. Replication in an independent GWAS dataset of phospholipids (Demirkan et al., PLoS Genetics, 2012) identified two novel loci supported by additional literature evidence: GPAM (Glycerol-3 phosphate acyltransferase) and CBS (Cystathionine beta-synthase). In addition, the workflow approach provided novel insight into the affected pathways and relevance of some of these gene-metabolite pairs in disease development and progression.Conclusions: We demonstrate the utility of automated exploitation of background knowledge present in pathway databases for the analysis of GWAS datasets of metabolomic phenotypes. We report novel loci and potential biochemical mechanisms that contribute to our understanding of the genetic basis of metabolic variation and its relationship to disease development and progression. Show less
Most methods for the interpretation of gene expression profiling experiments rely on the categorization of genes, as provided by the Gene Ontology (GO) and pathway databases. Due to the manual... Show moreMost methods for the interpretation of gene expression profiling experiments rely on the categorization of genes, as provided by the Gene Ontology (GO) and pathway databases. Due to the manual curation process, such databases are never up-to-date and tend to be limited in focus and coverage. Automated literature mining tools provide an attractive, alternative approach. We review how they can be employed for the interpretation of gene expression profiling experiments. We illustrate that their comprehensive scope aids the interpretation of data from domains poorly covered by GO or alternative databases, and allows for the linking of gene expression with diseases, drugs, tissues and other types of concepts. A framework for proper statistical evaluation of the associations between gene expression values and literature concepts was lacking and is now implemented in a weighted extension of global test. The weights are the literature association scores and reflect the importance of a gene for the concept of interest. In a direct comparison with classical GO-based gene sets, we show that use of literature-based associations results in the identification of much more specific GO categories. We demonstrate the possibilities for linking of gene expression data to patient survival in breast cancer and the action and metabolism of drugs. Coupling with online literature mining tools ensures transparency and allows further study of the identified associations. Literature mining tools are therefore powerful additions to the toolbox for the interpretation of high-throughput genomics data. Show less