We evaluate the shared genetic regulation of mRNA molecules, proteins and metabolites derived from whole blood from 3029 human donors. We find abundant allelic heterogeneity, where multiple... Show moreWe evaluate the shared genetic regulation of mRNA molecules, proteins and metabolites derived from whole blood from 3029 human donors. We find abundant allelic heterogeneity, where multiple variants regulate a particular molecular phenotype, and pleiotropy, where a single variant associates with multiple molecular phenotypes over multiple genomic regions. The highest proportion of share genetic regulation is detected between gene expression and proteins (66.6%), with a further median shared genetic associations across 49 different tissues of 78.3% and 62.4% between plasma proteins and gene expression. We represent the genetic and molecular associations in networks including 2828 known GWAS variants, showing that GWAS variants are more often connected to gene expression in trans than other molecular phenotypes in the network. Our work provides a roadmap to understanding molecular networks and deriving the underlying mechanism of action of GWAS variants using different molecular phenotypes in an accessible tissue. Show less
The application of multiple omics technologies in biomedical cohorts has the potential to reveal patient-level disease characteristics and individualized response to treatment. However, the scale... Show moreThe application of multiple omics technologies in biomedical cohorts has the potential to reveal patient-level disease characteristics and individualized response to treatment. However, the scale and heterogeneous nature of multi-modal data makes integration and inference a non-trivial task. We developed a deep-learning-based framework, multi-omics variational autoencoders (MOVE), to integrate such data and applied it to a cohort of 789 people with newly diagnosed type 2 diabetes with deep multi-omics phenotyping from the DIRECT consortium. Using in silico perturbations, we identified drug-omics associations across the multi-modal datasets for the 20 most prevalent drugs given to people with type 2 diabetes with substantially higher sensitivity than univariate statistical tests. From these, we among others, identified novel associations between metformin and the gut microbiota as well as opposite molecular responses for the two statins, simvastatin and atorvastatin. We used the associations to quantify drug-drug similarities, assess the degree of polypharmacy and conclude that drug effects are distributed across the multi-omics modalities. Show less
Background: NAFLD affects nearly 25% of the global population. Cardiovascular disease (CVD) is the most common cause of death among patients with NAFLD, in line with highly prevalent dyslipidemia... Show moreBackground: NAFLD affects nearly 25% of the global population. Cardiovascular disease (CVD) is the most common cause of death among patients with NAFLD, in line with highly prevalent dyslipidemia in this population. Increased plasma triglyceride (TG)-rich lipoprotein (TRL) concentrations, an important risk factor for CVD, are closely linked with hepatic TG content. Therefore, it is of great interest to identify regulatory mechanisms of hepatic TRL production and remnant uptake in the setting of hepatic steatosis.Approach and Results: To identify liver-regulated pathways linking intrahepatic and plasma TG metabolism, we performed transcriptomic analysis of liver biopsies from two independent cohorts of obese patients. Hepatic encoding apolipoprotein F (APOF) expression showed the fourth-strongest negatively correlation with hepatic steatosis and the strongest negative correlation with plasma TG levels. The effects of adenoviral-mediated human ApoF (hApoF) overexpression on plasma and hepatic TG were assessed in C57BL6/J mice. Surprisingly, hApoF overexpression increased both hepatic very low density lipoprotein (VLDL)-TG secretion and hepatic lipoprotein remnant clearance, associated a similar to 25% reduction in plasma TG levels. Conversely, reducing endogenous ApoF expression reduced VLDL secretion in vivo, and reduced hepatocyte VLDL uptake by similar to 15% in vitro. Transcriptomic analysis of APOF-overexpressing mouse livers revealed a gene signature related to enhanced ApoB-lipoprotein clearance, including increased expression of Ldlr and Lrp1, among others.Conclusion: These data reveal a previously undescribed role for ApoF in the control of plasma and hepatic lipoprotein metabolism by favoring VLDL-TG secretion and hepatic lipoprotein remnant particle clearance. Show less
Atabaki-Pasdar, N.; Ohlsson, M.; Vinuela, A.; Frau, F.; Pomares-Millan, H.; Haid, M.; ... ; Franks, P.W. 2020
BackgroundNon-alcoholic fatty liver disease (NAFLD) is highly prevalent and causes serious health complications in individuals with and without type 2 diabetes (T2D). Early diagnosis of NAFLD is... Show moreBackgroundNon-alcoholic fatty liver disease (NAFLD) is highly prevalent and causes serious health complications in individuals with and without type 2 diabetes (T2D). Early diagnosis of NAFLD is important, as this can help prevent irreversible damage to the liver and, ultimately, hepatocellular carcinomas. We sought to expand etiological understanding and develop a diagnostic tool for NAFLD using machine learning.Methods and findingsWe utilized the baseline data from IMI DIRECT, a multicenter prospective cohort study of 3,029 European-ancestry adults recently diagnosed with T2D (n= 795) or at high risk of developing the disease (n= 2,234). Multi-omics (genetic, transcriptomic, proteomic, and metabolomic) and clinical (liver enzymes and other serological biomarkers, anthropometry, measures of beta-cell function, insulin sensitivity, and lifestyle) data comprised the key input variables. The models were trained on MRI-image-derived liver fat content (<5% or >= 5%) available for 1,514 participants. We applied LASSO (least absolute shrinkage and selection operator) to select features from the different layers of omics data and random forest analysis to develop the models. The prediction models included clinical and omics variables separately or in combination. A model including all omics and clinical variables yielded a cross-validated receiver operating characteristic area under the curve (ROCAUC) of 0.84 (95% CI 0.82, 0.86;p <0.001), which compared with a ROCAUC of 0.82 (95% CI 0.81, 0.83;p <0.001) for a model including 9 clinically accessible variables. The IMI DIRECT prediction models outperformed existing noninvasive NAFLD prediction tools. One limitation is that these analyses were performed in adults of European ancestry residing in northern Europe, and it is unknown how well these findings will translate to people of other ancestries and exposed to environmental risk factors that differ from those of the present cohort. Another key limitation of this study is that the prediction was done on a binary outcome of liver fat quantity (<5% or >= 5%) rather than a continuous one.ConclusionsIn this study, we developed several models with different combinations of clinical and omics data and identified biological features that appear to be associated with liver fat accumulation. In general, the clinical variables showed better prediction ability than the complex omics variables. However, the combination of omics and clinical variables yielded the highest accuracy. We have incorporated the developed clinical models into a web interface (see:) and made it available to the community. Show less