We evaluate the shared genetic regulation of mRNA molecules, proteins and metabolites derived from whole blood from 3029 human donors. We find abundant allelic heterogeneity, where multiple... Show moreWe evaluate the shared genetic regulation of mRNA molecules, proteins and metabolites derived from whole blood from 3029 human donors. We find abundant allelic heterogeneity, where multiple variants regulate a particular molecular phenotype, and pleiotropy, where a single variant associates with multiple molecular phenotypes over multiple genomic regions. The highest proportion of share genetic regulation is detected between gene expression and proteins (66.6%), with a further median shared genetic associations across 49 different tissues of 78.3% and 62.4% between plasma proteins and gene expression. We represent the genetic and molecular associations in networks including 2828 known GWAS variants, showing that GWAS variants are more often connected to gene expression in trans than other molecular phenotypes in the network. Our work provides a roadmap to understanding molecular networks and deriving the underlying mechanism of action of GWAS variants using different molecular phenotypes in an accessible tissue. Show less
We identify biomarkers for disease progression in three type 2 diabetes cohorts encompassing 2,973 individuals across three molecular classes, metabolites, lipids and proteins. Homocitrulline,... Show moreWe identify biomarkers for disease progression in three type 2 diabetes cohorts encompassing 2,973 individuals across three molecular classes, metabolites, lipids and proteins. Homocitrulline, isoleucine and 2-aminoadipic acid, eight triacylglycerol species, and lowered sphingomyelin 42:2;2 levels are predictive of faster progression towards insulin requirement. Of ~1,300 proteins examined in two cohorts, levels of GDF15/MIC-1, IL-18Ra, CRELD1, NogoR, FAS, and ENPP7 are associated with faster progression, whilst SMAC/DIABLO, SPOCK1 and HEMK2 predict lower progression rates. In an external replication, proteins and lipids are associated with diabetes incidence and prevalence. NogoR/RTN4R injection improved glucose tolerance in high fat-fed male mice but impaired it in male db/db mice. High NogoR levels led to islet cell apoptosis, and IL-18R antagonised inflammatory IL-18 signalling towards nuclear factor kappa-B in vitro. This comprehensive, multi-disciplinary approach thus identifies biomarkers with potential prognostic utility, provides evidence for possible disease mechanisms, and identifies potential therapeutic avenues to slow diabetes progression. Show less
Type 2 diabetes is a multifactorial disease with multiple underlying aetiologies. To address this heterogeneity, investigators of a previous study clustered people with diabetes according to five... Show moreType 2 diabetes is a multifactorial disease with multiple underlying aetiologies. To address this heterogeneity, investigators of a previous study clustered people with diabetes according to five diabetes subtypes. The aim of the current study is to investigate the etiology of these clusters by comparing their molecular signatures. In three independent cohorts, in total 15,940 individuals were clustered based on five clinical characteristics. In a subset, genetic (N = 12,828), metabolomic (N = 2,945), lipidomic (N = 2,593), and proteomic (N = 1,170) data were obtained in plasma. For each data type, each cluster was compared with the other four clusters as the reference. The insulin-resistant cluster showed the most distinct molecular signature, with higher branched-chain amino acid, diacylglycerol, and triacylglycerol levels and aberrant protein levels in plasma were enriched for proteins in the intracellular PI3K/Akt pathway. The obese cluster showed higher levels of cytokines. The mild diabetes cluster with high HDL showed the most beneficial molecular profile with effects opposite of those seen in the insulin-resistant cluster. This study shows that clustering people with type 2 diabetes can identify underlying molecular mechanisms related to pancreatic islets, liver, and adipose tissue metabolism. This provides novel biological insights into the diverse aetiological processes that would not be evident when type 2 diabetes is viewed as a homogeneous disease. Show less
Aims/hypothesis Five clusters based on clinical characteristics have been suggested as diabetes subtypes: one autoimmune and four subtypes of type 2 diabetes. In the current study we replicate and... Show moreAims/hypothesis Five clusters based on clinical characteristics have been suggested as diabetes subtypes: one autoimmune and four subtypes of type 2 diabetes. In the current study we replicate and cross-validate these type 2 diabetes clusters in three large cohorts using variables readily measured in the clinic.Methods In three independent cohorts, in total 15,940 individuals were clustered based on age, BMI, HbA(1c), random or fasting C-peptide, and HDL-cholesterol. Clusters were cross-validated against the original clusters based on HOMA measures. In addition, between cohorts, clusters were cross-validated by re-assigning people based on each cohort's cluster centres. Finally, we compared the time to insulin requirement for each cluster.Results Five distinct type 2 diabetes clusters were identified and mapped back to the original four All New Diabetics in Scania (ANDIS) clusters. Using C-peptide and HDL-cholesterol instead of HOMA2-B and HOMA2-IR, three of the clusters mapped with high sensitivity (80.6-90.7%) to the previously identified severe insulin-deficient diabetes (SIDD), severe insulin-resistant diabetes (SIRD) and mild obesity-related diabetes (MOD) clusters. The previously described ANDIS mild age-related diabetes (MARD) cluster could be mapped to the two milder groups in our study: one characterised by high HDL-cholesterol (mild diabetes with high HDL-cholesterol [MDH] cluster), and the other not having any extreme characteristic (mild diabetes [MD]). When these two milder groups were combined, they mapped well to the previously labelled MARD cluster (sensitivity 79.1%). In the cross-validation between cohorts, particularly the SIDD and MDH clusters cross-validated well, with sensitivities ranging from 73.3% to 97.1%. SIRD and MD showed a lower sensitivity, ranging from 36.1% to 92.3%, where individuals shifted from SIRD to MD and vice versa. People belonging to the SIDD cluster showed the fastest progression towards insulin requirement, while the MDH cluster showed the slowest progression.Conclusions/interpretation Clusters based on C-peptide instead of HOMA2 measures resemble those based on HOMA2 measures, especially for SIDD, SIRD and MOD. By adding HDL-cholesterol, the MARD cluster based upon HOMA2 measures resulted in the current clustering into two clusters, with one cluster having high HDL levels. Cross-validation between cohorts showed generally a good resemblance between cohorts. Together, our results show that the clustering based on clinical variables readily measured in the clinic (age, HbA(1c), HDL-cholesterol, BMI and C-peptide) results in informative clusters that are representative of the original ANDIS clusters and stable across cohorts. Adding HDL-cholesterol to the clustering resulted in the identification of a cluster with very slow glycaemic deterioration. Show less
Atabaki-Pasdar, N.; Ohlsson, M.; Vinuela, A.; Frau, F.; Pomares-Millan, H.; Haid, M.; ... ; Franks, P.W. 2020
BackgroundNon-alcoholic fatty liver disease (NAFLD) is highly prevalent and causes serious health complications in individuals with and without type 2 diabetes (T2D). Early diagnosis of NAFLD is... Show moreBackgroundNon-alcoholic fatty liver disease (NAFLD) is highly prevalent and causes serious health complications in individuals with and without type 2 diabetes (T2D). Early diagnosis of NAFLD is important, as this can help prevent irreversible damage to the liver and, ultimately, hepatocellular carcinomas. We sought to expand etiological understanding and develop a diagnostic tool for NAFLD using machine learning.Methods and findingsWe utilized the baseline data from IMI DIRECT, a multicenter prospective cohort study of 3,029 European-ancestry adults recently diagnosed with T2D (n= 795) or at high risk of developing the disease (n= 2,234). Multi-omics (genetic, transcriptomic, proteomic, and metabolomic) and clinical (liver enzymes and other serological biomarkers, anthropometry, measures of beta-cell function, insulin sensitivity, and lifestyle) data comprised the key input variables. The models were trained on MRI-image-derived liver fat content (<5% or >= 5%) available for 1,514 participants. We applied LASSO (least absolute shrinkage and selection operator) to select features from the different layers of omics data and random forest analysis to develop the models. The prediction models included clinical and omics variables separately or in combination. A model including all omics and clinical variables yielded a cross-validated receiver operating characteristic area under the curve (ROCAUC) of 0.84 (95% CI 0.82, 0.86;p <0.001), which compared with a ROCAUC of 0.82 (95% CI 0.81, 0.83;p <0.001) for a model including 9 clinically accessible variables. The IMI DIRECT prediction models outperformed existing noninvasive NAFLD prediction tools. One limitation is that these analyses were performed in adults of European ancestry residing in northern Europe, and it is unknown how well these findings will translate to people of other ancestries and exposed to environmental risk factors that differ from those of the present cohort. Another key limitation of this study is that the prediction was done on a binary outcome of liver fat quantity (<5% or >= 5%) rather than a continuous one.ConclusionsIn this study, we developed several models with different combinations of clinical and omics data and identified biological features that appear to be associated with liver fat accumulation. In general, the clinical variables showed better prediction ability than the complex omics variables. However, the combination of omics and clinical variables yielded the highest accuracy. We have incorporated the developed clinical models into a web interface (see:) and made it available to the community. Show less