Background: Genetic variants within nearly 1000 loci are known to contribute to modulation of blood lipid levels. However, the biological pathways underlying these associations are frequently... Show moreBackground: Genetic variants within nearly 1000 loci are known to contribute to modulation of blood lipid levels. However, the biological pathways underlying these associations are frequently unknown, limiting understanding of these findings and hindering downstream translational efforts such as drug target discovery. Results: To expand our understanding of the underlying biological pathways and mechanisms controlling blood lipid levels, we leverage a large multi-ancestry meta-analysis (N=1,654,960) of blood lipids to prioritize putative causal genes for 2286 lipid associations using six gene prediction approaches. Using phenome-wide association (PheWAS) scans, we identify relationships of genetically predicted lipid levels to other diseases and conditions. We confirm known pleiotropic associations with cardiovascular phenotypes and determine novel associations, notably with cholelithiasis risk. We perform sex-stratified GWAS meta-analysis of lipid levels and show that 3-5% of autosomal lipid-associated loci demonstrate sex-biased effects. Finally, we report 21 novel lipid loci identified on the X chromosome. Many of the sex-biased autosomal and X chromosome lipid loci show pleiotropic associations with sex hormones, emphasizing the role of hormone regulation in lipid metabolism. Conclusions: Taken together, our findings provide insights into the biological mechanisms through which associated variants lead to altered lipid levels and potentially cardiovascular disease risk. Show less
A major challenge of genome-wide association studies (GWASs) is to translate phenotypic associations into biological insights. Here, we integrate a large GWAS on blood lipids involving 1.6 million... Show moreA major challenge of genome-wide association studies (GWASs) is to translate phenotypic associations into biological insights. Here, we integrate a large GWAS on blood lipids involving 1.6 million individuals from five ancestries with a wide array of functional genomic datasets to discover regulatory mechanisms underlying lipid associations. We first prioritize lipid-associated genes with expression quantitative trait locus (eQTL) colocalizations and then add chromatin interaction data to narrow the search for functional genes. Polygenic enrichment analysis across 697 annotations from a host of tissues and cell types confirms the central role of the liver in lipid levels and highlights the selective enrichment of adipose-specific chromatin marks in high-density lipoprotein cholesterol and triglycerides. Overlapping transcription factor (TF) binding sites with lipid-associated loci identifies TFs relevant in lipid biology. In addition, we present an integrative framework to prioritize causal variants at GWAS loci, producing a comprehensive list of candidate causal genes and variants with multiple layers of functional evidence. We highlight two of the prioritized genes, CREBRF and RRBP1, which show convergent evidence across functional datasets supporting their roles in lipid biology. Show less
The presentation and underlying pathophysiology of type 2 diabetes (T2D) is complex and heterogeneous. Recent studies attempted to stratify T2D into distinct subgroups using data-driven approaches,... Show moreThe presentation and underlying pathophysiology of type 2 diabetes (T2D) is complex and heterogeneous. Recent studies attempted to stratify T2D into distinct subgroups using data-driven approaches, but their clinical utility may be limited if categorical representations of complex phenotypes are suboptimal. We apply a soft-clustering (archetype) method to characterize newly diagnosed T2D based on 32 clinical variables. We assign quantitative clustering scores for individuals and investigate the associations with glycemic deterioration, genetic risk scores, circulating omics biomarkers, and phenotypic stability over 36 months. Four archetype profiles represent dysfunction patterns across combinations of T2D etiological processes and correlate with multiple circulating biomarkers. One archetype associated with obesity, insulin resistance, dyslipidemia, and impaired 1 beta cell glucose sensitivity corresponds with the fastest disease progression and highest demand for anti-diabetic treatment. We demonstrate that clinical heterogeneity in T2D can be mapped to heterogeneity in individual etiological processes, providing a potential route to personalized treatments. Show less
Common single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions... Show moreCommon single-nucleotide polymorphisms (SNPs) are predicted to collectively explain 40-50% of phenotypic variation in human height, but identifying the specific variants and associated regions requires huge sample sizes1. Here, using data from a genome-wide association study of 5.4 million individuals of diverse ancestries, we show that 12,111 independent SNPs that are significantly associated with height account for nearly all of the common SNP-based heritability. These SNPs are clustered within 7,209 non-overlapping genomic segments with a mean size of around 90 kb, covering about 21% of the genome. The density of independent associations varies across the genome and the regions of increased density are enriched for biologically relevant genes. In out-of-sample estimation and prediction, the 12,111 SNPs (or all SNPs in the HapMap 3 panel2) account for 40% (45%) of phenotypic variance in populations of European ancestry but only around 10-20% (14-24%) in populations of other ancestries. Effect sizes, associated regions and gene prioritization are similar across ancestries, indicating that reduced prediction accuracy is likely to be explained by linkage disequilibrium and differences in allele frequency within associated regions. Finally, we show that the relevant biological pathways are detectable with smaller sample sizes than are needed to implicate causal genes and variants. Overall, this study provides a comprehensive map of specific genomic regions that contain the vast majority of common height-associated variants. Although this map is saturated for populations of European ancestry, further research is needed to achieve equivalent saturation in other ancestries. Show less
Increased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use(1). Despite advances in... Show moreIncreased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use(1). Despite advances in prevention and treatment, in particular through reducing low-density lipoprotein cholesterol levels(2), heart disease remains the leading cause of death worldwide(3). Genome-wideassociation studies (GWAS) of blood lipid levels have led to important biological and clinical insights, as well as new drug targets, for cardiovascular disease. However, most previous GWAS(4-23) have been conducted in European ancestry populations and may have missed genetic variants that contribute to lipid-level variation in other ancestry groups. These include differences in allele frequencies, effect sizes and linkage-disequilibrium patterns(24). Here we conduct a multi-ancestry, genome-wide genetic discovery meta-analysis of lipid levels in approximately 1.65 million individuals, including 350,000 of non-European ancestries. We quantify the gain in studying non-European ancestries and provide evidence to support the expansion of recruitment of additional ancestries, even with relatively small sample sizes. We find that increasing diversity rather than studying additional individuals of European ancestry results in substantial improvements in fine-mapping functional variants and portability of polygenic prediction (evaluated in approximately 295,000 individuals from 7 ancestry groupings). Modest gains in the number of discovered loci and ancestry-specific variants were also achieved. As GWAS expand emphasis beyond the identification of genes and fundamental biology towards the use of genetic variants for preventive and precision medicine(25), we anticipate that increased diversity of participants will lead to more accurate and equitable(26) application of polygenic scores in clinical practice. Show less
OBJECTIVEWe investigated the processes underlying glycemic deterioration in type 2 diabetes (T2D).RESEARCH DESIGN AND METHODSA total of 732 recently diagnosed patients with T2D from the Innovative... Show moreOBJECTIVEWe investigated the processes underlying glycemic deterioration in type 2 diabetes (T2D).RESEARCH DESIGN AND METHODSA total of 732 recently diagnosed patients with T2D from the Innovative Medicines Initiative Diabetes Research on Patient Stratification (IMI DIRECT) study were extensively phenotyped over 3 years, including measures of insulin sensitivity (OGIS), beta-cell glucose sensitivity (GS), and insulin clearance (CLIm) from mixed meal tests, liver enzymes, lipid profiles, and baseline regional fat from MRI. The associations between the longitudinal metabolic patterns and HbA(1c) deterioration, adjusted for changes in BMI and in diabetes medications, were assessed via stepwise multivariable linear and logistic regression.RESULTSFaster HbA(1c) progression was independently associated with faster deterioration of OGIS and GS and increasing CLIm; visceral or liver fat, HDL-cholesterol, and triglycerides had further independent, though weaker, roles (R-2 = 0.38). A subgroup of patients with a markedly higher progression rate (fast progressors) was clearly distinguishable considering these variables only (discrimination capacity from area under the receiver operating characteristic = 0.94). The proportion of fast progressors was reduced from 56% to 8-10% in subgroups in which only one trait among OGIS, GS, and CLIm was relatively stable (odds ratios 0.07-0.09). T2D polygenic risk score and baseline pancreatic fat, glucagon-like peptide 1, glucagon, diet, and physical activity did not show an independent role.CONCLUSIONSDeteriorating insulin sensitivity and beta-cell function, increasing insulin clearance, high visceral or liver fat, and worsening of the lipid profile are the crucial factors mediating glycemic deterioration of patients with T2D in the initial phase of the disease. Stabilization of a single trait among insulin sensitivity, beta-cell function, and insulin clearance may be relevant to prevent progression. Show less
AimSubclasses of different glycaemic disturbances could explain the variation in characteristics of individuals with type 2 diabetes (T2D). We aimed to examine the association between subgroups... Show moreAimSubclasses of different glycaemic disturbances could explain the variation in characteristics of individuals with type 2 diabetes (T2D). We aimed to examine the association between subgroups based on their glucose curves during a five-point mixed-meal tolerance test (MMT) and metabolic traits at baseline and glycaemic deterioration in individuals with T2D.MethodsThe study included 787 individuals with newly diagnosed T2D from the Diabetes Research on Patient Stratification (IMI-DIRECT) Study. Latent class trajectory analysis (LCTA) was used to identify distinct glucose curve subgroups during a five-point MMT. Using general linear models, these subgroups were associated with metabolic traits at baseline and after 18 months of follow up, adjusted for potential confounders.ResultsAt baseline, we identified three glucose curve subgroups, labelled in order of increasing glucose peak levels as subgroup 1-3. Individuals in subgroup 2 and 3 were more likely to have higher levels of HbA1c, triglycerides and BMI at baseline, compared to those in subgroup 1. At 18 months (n = 651), the beta coefficients (95% CI) for change in HbA1c (mmol/mol) increased across subgroups with 0.37 (-0.18-1.92) for subgroup 2 and 1.88 (-0.08-3.85) for subgroup 3, relative to subgroup 1. The same trend was observed for change in levels of triglycerides and fasting glucose.ConclusionsDifferent glycaemic profiles with different metabolic traits and different degrees of subsequent glycaemic deterioration can be identified using data from a frequently sampled mixed-meal tolerance test in individuals with T2D. Subgroups with the highest peaks had greater metabolic risk. Show less
Obura, M.; Beulens, J.W.J.; Slieker, R.; Koopman, A.D.M.; Hoekstra, T.; Nijpels, G.; ... ; Rutters, F. 2020
Aim To examine the hypothesis that, based on their glucose curves during a seven-point oral glucose tolerance test, people at elevated type 2 diabetes risk can be divided into subgroups with... Show moreAim To examine the hypothesis that, based on their glucose curves during a seven-point oral glucose tolerance test, people at elevated type 2 diabetes risk can be divided into subgroups with different clinical profiles at baseline and different degrees of subsequent glycaemic deterioration.Methods We included 2126 participants at elevated type 2 diabetes risk from the Diabetes Research on Patient Stratification (IMI-DIRECT) study. Latent class trajectory analysis was used to identify subgroups from a seven-point oral glucose tolerance test at baseline and follow-up. Linear models quantified the associations between the subgroups with glycaemic traits at baseline and 18 months.Results At baseline, we identified four glucose curve subgroups, labelled in order of increasing peak levels as 1-4. Participants in Subgroups 2-4, were more likely to have higher insulin resistance (homeostatic model assessment) and a lower Matsuda index, than those in Subgroup 1. Overall, participants in Subgroups 3 and 4, had higher glycaemic trait values, with the exception of the Matsuda and insulinogenic indices. At 18 months, change in homeostatic model assessment of insulin resistance was higher in Subgroup 4 (beta = 0.36, 95% CI 0.13-0.58), Subgroup 3 (beta = 0.30; 95% CI 0.10-0.50) and Subgroup 2 (beta = 0.18; 95% CI 0.04-0.32), compared to Subgroup 1. The same was observed for C-peptide and insulin. Five subgroups were identified at follow-up, and the majority of participants remained in the same subgroup or progressed to higher peak subgroups after 18 months.Conclusions Using data from a frequently sampled oral glucose tolerance test, glucose curve patterns associated with different clinical characteristics and different rates of subsequent glycaemic deterioration can be identified. Show less
Atabaki-Pasdar, N.; Ohlsson, M.; Vinuela, A.; Frau, F.; Pomares-Millan, H.; Haid, M.; ... ; Franks, P.W. 2020
BackgroundNon-alcoholic fatty liver disease (NAFLD) is highly prevalent and causes serious health complications in individuals with and without type 2 diabetes (T2D). Early diagnosis of NAFLD is... Show moreBackgroundNon-alcoholic fatty liver disease (NAFLD) is highly prevalent and causes serious health complications in individuals with and without type 2 diabetes (T2D). Early diagnosis of NAFLD is important, as this can help prevent irreversible damage to the liver and, ultimately, hepatocellular carcinomas. We sought to expand etiological understanding and develop a diagnostic tool for NAFLD using machine learning.Methods and findingsWe utilized the baseline data from IMI DIRECT, a multicenter prospective cohort study of 3,029 European-ancestry adults recently diagnosed with T2D (n= 795) or at high risk of developing the disease (n= 2,234). Multi-omics (genetic, transcriptomic, proteomic, and metabolomic) and clinical (liver enzymes and other serological biomarkers, anthropometry, measures of beta-cell function, insulin sensitivity, and lifestyle) data comprised the key input variables. The models were trained on MRI-image-derived liver fat content (<5% or >= 5%) available for 1,514 participants. We applied LASSO (least absolute shrinkage and selection operator) to select features from the different layers of omics data and random forest analysis to develop the models. The prediction models included clinical and omics variables separately or in combination. A model including all omics and clinical variables yielded a cross-validated receiver operating characteristic area under the curve (ROCAUC) of 0.84 (95% CI 0.82, 0.86;p <0.001), which compared with a ROCAUC of 0.82 (95% CI 0.81, 0.83;p <0.001) for a model including 9 clinically accessible variables. The IMI DIRECT prediction models outperformed existing noninvasive NAFLD prediction tools. One limitation is that these analyses were performed in adults of European ancestry residing in northern Europe, and it is unknown how well these findings will translate to people of other ancestries and exposed to environmental risk factors that differ from those of the present cohort. Another key limitation of this study is that the prediction was done on a binary outcome of liver fat quantity (<5% or >= 5%) rather than a continuous one.ConclusionsIn this study, we developed several models with different combinations of clinical and omics data and identified biological features that appear to be associated with liver fat accumulation. In general, the clinical variables showed better prediction ability than the complex omics variables. However, the combination of omics and clinical variables yielded the highest accuracy. We have incorporated the developed clinical models into a web interface (see:) and made it available to the community. Show less