Introduction: Educational attainment, widely used in epidemiologic studies as a surrogate for socioeconomic status, is a predictor of cardiovascular health outcomes.Methods: A two-stage genome-wide... Show moreIntroduction: Educational attainment, widely used in epidemiologic studies as a surrogate for socioeconomic status, is a predictor of cardiovascular health outcomes.Methods: A two-stage genome-wide meta-analysis of low-density lipoprotein cholesterol (LDL), high-density lipoprotein cholesterol (HDL), and triglyceride (TG) levels was performed while accounting for gene-educational attainment interactions in up to 226,315 individuals from five population groups. We considered two educational attainment variables: "Some College" (yes/no, for any education beyond high school) and "Graduated College" (yes/no, for completing a 4-year college degree). Genome-wide significant (p < 5 x 10(-8)) and suggestive (p < 1 x 10(-6)) variants were identified in Stage 1 (in up to 108,784 individuals) through genome-wide analysis, and those variants were followed up in Stage 2 studies (in up to 117,531 individuals).Results: In combined analysis of Stages 1 and 2, we identified 18 novel lipid loci (nine for LDL, seven for HDL, and two for TG) by two degree-of-freedom (2 DF) joint tests of main and interaction effects. Four loci showed significant interaction with educational attainment. Two loci were significant only in cross-population analyses. Several loci include genes with known or suggested roles in adipose (FOXP1, MBOAT4, SKP2, STIM1, STX4), brain (BRI3, FILIP1, FOXP1, LINC00290, LMTK2, MBOAT4, MYO6, SENP6, SRGAP3, STIM1, TMEM167A, TMEM30A), and liver (BRI3, FOXP1) biology, highlighting the potential importance of brain-adipose-liver communication in the regulation of lipid metabolism. An investigation of the potential druggability of genes in identified loci resulted in five gene targets shown to interact with drugs approved by the Food and Drug Administration, including genes with roles in adipose and brain tissue.Discussion: Genome-wide interaction analysis of educational attainment identified novel lipid loci not previously detected by analyses limited to main genetic effects. Show less
Zanti, M.; O'Mahony, D.G.; Parsons, M.T.; Li, H.Y.; Dennis, J.; Aittomäkkiki, K.; ... ; GC-HBOC Study Collaborators 2023
A large number of variants identified through clinical genetic testing in disease susceptibility genes are of uncertain significance (VUS). Following the recommendations of the American College of... Show moreA large number of variants identified through clinical genetic testing in disease susceptibility genes are of uncertain significance (VUS). Following the recommendations of the American College of Medical Genetics and Genomics (ACMG) and Association for Molecular Pathology (AMP), the frequency in case-control datasets (PS4 criterion) can inform their interpretation. We present a novel case-control likelihood ratio-based method that incorporates gene-specific age-related penetrance. We demonstrate the utility of this method in the analysis of simulated and real datasets. In the analysis of simulated data, the likelihood ratio method was more powerful compared to other methods. Likelihood ratios were calculated for a case-control dataset of BRCA1 and BRCA2 variants from the Breast Cancer Association Consortium (BCAC) and compared with logistic regression results. A larger number of variants reached evidence in favor of pathogenicity, and a substantial number of variants had evidence against pathogenicity-findings that would not have been reached using other case-control analysis methods. Our novel method provides greater power to classify rare variants compared with classical case-control methods. As an initiative from the ENIGMA Analytical Working Group, we provide user-friendly scripts and preformatted Excel calculators for implementation of the method for rare variants in BRCA1, BRCA2, and other high-risk genes with known penetrance. Show less
Background Low-frequency variants play an important role in breast cancer (BC) susceptibility. Gene-based methods can increase power by combining multiple variants in the same gene and help... Show moreBackground Low-frequency variants play an important role in breast cancer (BC) susceptibility. Gene-based methods can increase power by combining multiple variants in the same gene and help identify target genes.Methods We evaluated the potential of gene-based aggregation in the Breast Cancer Association Consortium cohorts including 83,471 cases and 59,199 controls. Low-frequency variants were aggregated for individual genes' coding and regulatory regions. Association results in European ancestry samples were compared to single-marker association results in the same cohort. Gene-based associations were also combined in meta-analysis across individuals with European, Asian, African, and Latin American and Hispanic ancestry.Results In European ancestry samples, 14 genes were significantly associated (q < 0.05) with BC. Of those, two genes, FMNL3 (P = 6.11 x 10(-6)) and AC058822.1 (P = 1.47 x 10(-4)), represent new associations. High FMNL3 expression has previously been linked to poor prognosis in several other cancers. Meta-analysis of samples with diverse ancestry discovered further associations including established candidate genes ESR1 and CBLB. Furthermore, literature review and database query found further support for a biologically plausible link with cancer for genes CBLB, FMNL3, FGFR2, LSP1, MAP3K1, and SRGAP2C.Conclusions Using extended gene-based aggregation tests including coding and regulatory variation, we report identification of plausible target genes for previously identified single-marker associations with BC as well as the discovery of novel genes implicated in BC development. Including multi ancestral cohorts in this study enabled the identification of otherwise missed disease associations as ESR1 (P = 1.31 x 10(-5)), demonstrating the importance of diversifying study cohorts. Show less
Background: Genetic variants within nearly 1000 loci are known to contribute to modulation of blood lipid levels. However, the biological pathways underlying these associations are frequently... Show moreBackground: Genetic variants within nearly 1000 loci are known to contribute to modulation of blood lipid levels. However, the biological pathways underlying these associations are frequently unknown, limiting understanding of these findings and hindering downstream translational efforts such as drug target discovery. Results: To expand our understanding of the underlying biological pathways and mechanisms controlling blood lipid levels, we leverage a large multi-ancestry meta-analysis (N=1,654,960) of blood lipids to prioritize putative causal genes for 2286 lipid associations using six gene prediction approaches. Using phenome-wide association (PheWAS) scans, we identify relationships of genetically predicted lipid levels to other diseases and conditions. We confirm known pleiotropic associations with cardiovascular phenotypes and determine novel associations, notably with cholelithiasis risk. We perform sex-stratified GWAS meta-analysis of lipid levels and show that 3-5% of autosomal lipid-associated loci demonstrate sex-biased effects. Finally, we report 21 novel lipid loci identified on the X chromosome. Many of the sex-biased autosomal and X chromosome lipid loci show pleiotropic associations with sex hormones, emphasizing the role of hormone regulation in lipid metabolism. Conclusions: Taken together, our findings provide insights into the biological mechanisms through which associated variants lead to altered lipid levels and potentially cardiovascular disease risk. Show less
A major challenge of genome-wide association studies (GWASs) is to translate phenotypic associations into biological insights. Here, we integrate a large GWAS on blood lipids involving 1.6 million... Show moreA major challenge of genome-wide association studies (GWASs) is to translate phenotypic associations into biological insights. Here, we integrate a large GWAS on blood lipids involving 1.6 million individuals from five ancestries with a wide array of functional genomic datasets to discover regulatory mechanisms underlying lipid associations. We first prioritize lipid-associated genes with expression quantitative trait locus (eQTL) colocalizations and then add chromatin interaction data to narrow the search for functional genes. Polygenic enrichment analysis across 697 annotations from a host of tissues and cell types confirms the central role of the liver in lipid levels and highlights the selective enrichment of adipose-specific chromatin marks in high-density lipoprotein cholesterol and triglycerides. Overlapping transcription factor (TF) binding sites with lipid-associated loci identifies TFs relevant in lipid biology. In addition, we present an integrative framework to prioritize causal variants at GWAS loci, producing a comprehensive list of candidate causal genes and variants with multiple layers of functional evidence. We highlight two of the prioritized genes, CREBRF and RRBP1, which show convergent evidence across functional datasets supporting their roles in lipid biology. Show less
Objectives Physical inactivity and sedentary behaviour are associated with higher breast cancer risk in observational studies, but ascribing causality is difficult. Mendelian randomisation (MR)... Show moreObjectives Physical inactivity and sedentary behaviour are associated with higher breast cancer risk in observational studies, but ascribing causality is difficult. Mendelian randomisation (MR) assesses causality by simulating randomised trial groups using genotype. We assessed whether lifelong physical activity or sedentary time, assessed using genotype, may be causally associated with breast cancer risk overall, pre/post-menopause, and by case-groups defined by tumour characteristics.Methods We performed two-sample inverse-variance-weighted MR using individual-level Breast Cancer Association Consortium case-control data from 130 957 European-ancestry women (69 838 invasive cases), and published UK Biobank data (n=91 105-377 234). Genetic instruments were single nucleotide polymorphisms (SNPs) associated in UK Biobank with wrist-worn accelerometer-measured overall physical activity (n(snps)=5) or sedentary time (n(snps)=6), or accelerometer-measured (n(snps)=1) or self-reported (n(snps)=5) vigorous physical activity.Results Greater genetically-predicted overall activity was associated with lower breast cancer overall risk (OR=0.59; 95% confidence interval (CI) 0.42 to 0.83 per-standard deviation (SD;similar to 8 milligravities acceleration)) and for most case-groups. Genetically-predicted vigorous activity was associated with lower risk of pre/perimenopausal breast cancer (OR=0.62; 95% CI 0.45 to 0.87,>= 3 vs. 0 self-reported days/week), with consistent estimates for most case-groups. Greater genetically-predicted sedentary time was associated with higher hormone-receptor-negative tumour risk (OR=1.77; 95% CI 1.07 to 2.92 per-SD (similar to 7% time spent sedentary)), with elevated estimates for most case-groups. Results were robust to sensitivity analyses examining pleiotropy (including weighted-median-MR, MR-Egger).Conclusion Our study provides strong evidence that greater overall physical activity, greater vigorous activity, and lower sedentary time are likely to reduce breast cancer risk. More widespread adoption of active lifestyles may reduce the burden from the most common cancer in women. Show less
Background Genome-wide association studies (GWAS) have identified multiple common breast cancer susceptibility variants. Many of these variants have differential associations by estrogen receptor ... Show moreBackground Genome-wide association studies (GWAS) have identified multiple common breast cancer susceptibility variants. Many of these variants have differential associations by estrogen receptor (ER) status, but how these variants relate with other tumor features and intrinsic molecular subtypes is unclear. Methods Among 106,571 invasive breast cancer cases and 95,762 controls of European ancestry with data on 173 breast cancer variants identified in previous GWAS, we used novel two-stage polytomous logistic regression models to evaluate variants in relation to multiple tumor features (ER, progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) and grade) adjusting for each other, and to intrinsic-like subtypes. Results Eighty-five of 173 variants were associated with at least one tumor feature (false discovery rate < 5%), most commonly ER and grade, followed by PR and HER2. Models for intrinsic-like subtypes found nearly all of these variants (83 of 85) associated at p < 0.05 with risk for at least one luminal-like subtype, and approximately half (41 of 85) of the variants were associated with risk of at least one non-luminal subtype, including 32 variants associated with triple-negative (TN) disease. Ten variants were associated with risk of all subtypes in different magnitude. Five variants were associated with risk of luminal A-like and TN subtypes in opposite directions. Conclusion This report demonstrates a high level of complexity in the etiology heterogeneity of breast cancer susceptibility variants and can inform investigations of subtype-specific risk prediction. Show less
Increased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use(1). Despite advances in... Show moreIncreased blood lipid levels are heritable risk factors of cardiovascular disease with varied prevalence worldwide owing to different dietary patterns and medication use(1). Despite advances in prevention and treatment, in particular through reducing low-density lipoprotein cholesterol levels(2), heart disease remains the leading cause of death worldwide(3). Genome-wideassociation studies (GWAS) of blood lipid levels have led to important biological and clinical insights, as well as new drug targets, for cardiovascular disease. However, most previous GWAS(4-23) have been conducted in European ancestry populations and may have missed genetic variants that contribute to lipid-level variation in other ancestry groups. These include differences in allele frequencies, effect sizes and linkage-disequilibrium patterns(24). Here we conduct a multi-ancestry, genome-wide genetic discovery meta-analysis of lipid levels in approximately 1.65 million individuals, including 350,000 of non-European ancestries. We quantify the gain in studying non-European ancestries and provide evidence to support the expansion of recruitment of additional ancestries, even with relatively small sample sizes. We find that increasing diversity rather than studying additional individuals of European ancestry results in substantial improvements in fine-mapping functional variants and portability of polygenic prediction (evaluated in approximately 295,000 individuals from 7 ancestry groupings). Modest gains in the number of discovered loci and ancestry-specific variants were also achieved. As GWAS expand emphasis beyond the identification of genes and fundamental biology towards the use of genetic variants for preventive and precision medicine(25), we anticipate that increased diversity of participants will lead to more accurate and equitable(26) application of polygenic scores in clinical practice. Show less
Background Despite a modest association between tobacco smoking and breast cancer risk reported by recent epidemiological studies, it is still equivocal whether smoking is causally related to... Show moreBackground Despite a modest association between tobacco smoking and breast cancer risk reported by recent epidemiological studies, it is still equivocal whether smoking is causally related to breast cancer risk. Methods We applied Mendelian randomisation (MR) to evaluate a potential causal effect of cigarette smoking on breast cancer risk. Both individual-level data as well as summary statistics for 164 single-nucleotide polymorphisms (SNPs) reported in genome-wide association studies of lifetime smoking index (LSI) or cigarette per day (CPD) were used to obtain MR effect estimates. Data from 108,420 invasive breast cancer cases and 87,681 controls were used for the LSI analysis and for the CPD analysis conducted among ever-smokers from 26,147 cancer cases and 26,072 controls. Sensitivity analyses were conducted to address pleiotropy. Results Genetically predicted LSI was associated with increased breast cancer risk (OR 1.18 per SD, 95% CI: 1.07-1.30, P = 0.11 x 10(-2)), but there was no evidence of association for genetically predicted CPD (OR 1.02, 95% CI: 0.78-1.19, P = 0.85). The sensitivity analyses yielded similar results and showed no strong evidence of pleiotropic effect. Conclusion Our MR study provides supportive evidence for a potential causal association with breast cancer risk for lifetime smoking exposure but not cigarettes per day among smokers. Show less
A combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 ... Show moreA combination of genetic and functional approaches has identified three independent breast cancer risk loci at 2q35. A recent fine-scale mapping analysis to refine these associations resulted in 1 (signal 1), 5 (signal 2), and 42 (signal 3) credible causal variants at these loci. We used publicly available in silico DNase I and ChIP-seq data with in vitro reporter gene and CRISPR assays to annotate signals 2 and 3. We identified putative regulatory elements that enhanced cell-type-specific transcription from the IGFBP5 promoter at both signals (30-to 40-fold increased expression by the putative regulatory element at signal 2, 2- to 3-fold by the putative regulatory element at signal 3). We further identified one of the five credible causal variants at signal 2, a 1.4 kb deletion (esv3594306), as the likely causal variant; the deletion allele of this variant was associated with an average additional increase in IGFBP5 expression of 1.3-fold (MCF-7) and 2.2-fold (T-47D). We propose a model in which the deletion allele of esv3594306 juxtaposes two transcription factor binding regions (annotated by estrogen receptor alpha ChIP-seq peaks) to generate a single extended regulatory element. This regulatory element increases cell-type-specific expression of the tumor suppressor gene IGFBP5 and, thereby, reduces risk of estrogen receptor-positive breast cancer (odds ratio = 0.77, 95% CI 0.74-0.81, p = 3.1 x 10(-31)). Show less
Wang, H.M.; Noordam, R.; Cade, B.E.; Schwander, K.; Winkler, T.W.; Lee, J.; ... ; Heemst, D. van 2021
Long and short sleep duration are associated with elevated blood pressure (BP), possibly through effects on molecular pathways that influence neuroendocrine and vascular systems. To gain new... Show moreLong and short sleep duration are associated with elevated blood pressure (BP), possibly through effects on molecular pathways that influence neuroendocrine and vascular systems. To gain new insights into the genetic basis of sleep-related BP variation, we performed genome-wide gene by short or long sleep duration interaction analyses on four BP traits (systolic BP, diastolic BP, mean arterial pressure, and pulse pressure) across five ancestry groups in two stages using 2 degree of freedom (df) joint test followed by 1df test of interaction effects. Primary multi-ancestry analysis in 62,969 individuals in stage 1 identified three novel gene by sleep interactions that were replicated in an additional 59,296 individuals in stage 2 (stage 1 + 2 P-joint < 5 x 10(-8)), including rs7955964 (FIGNL2/ANKRD33) that increases BP among long sleepers, and rs73493041 (SNORA26/C9orf170) and rs10406644 (KCTD15/LSM14A) that increase BP among short sleepers (P-int < 5 x 10(-8)). Secondary ancestry-specific analysis identified another novel gene by long sleep interaction at rs111887471 (TRPC3/KIAA1109) in individuals of African ancestry (P-int = 2 x 10(-6)). Combined stage 1 and 2 analyses additionally identified significant gene by long sleep interactions at 10 loci including MKLN1 and RGL3/ELAVL3 previously associated with BP, and significant gene by short sleep interactions at 10 loci including C2orf43 previously associated with BP (P-int < 10(-3)). 2df test also identified novel loci for BP after modeling sleep that has known functions in sleep-wake regulation, nervous and cardiometabolic systems. This study indicates that sleep and primary mechanisms regulating BP may interact to elevate BP level, suggesting novel insights into sleep-related BP regulation. Show less
Background: It is not known whether modifiable lifestyle factors that predict survival after invasive breast cancer differ by subtype.Methods: We analyzed data for 121,435 women diagnosed with... Show moreBackground: It is not known whether modifiable lifestyle factors that predict survival after invasive breast cancer differ by subtype.Methods: We analyzed data for 121,435 women diagnosed with breast cancer from 67 studies in the Breast Cancer Association Consortium with 16,890 deaths (8,554 breast cancer specific) over 10 years. Cox regression was used to estimate associations between risk factors and 10-year all-cause mortality and breast cancer-specific mortality overall, by estrogen receptor (ER) status, and by intrinsic-like subtype.Results: There was no evidence of heterogeneous associations between risk factors and mortality by subtype (P-adj > 0.30). The strongest associations were between all-cause mortality and BMI >= 30 versus 18.5-25 kg/m(2) [HR (95% confidence interval (CI), 1.19 (1.06-1.34)]; current versus never smoking [1.37 (1.27-1.47)], high versus low physical activity [0.43 (0.21-0.86)], age >= 30 years versus < 20 years at first pregnancy [0.79 (0.72-0.86)]; >0-< 5 years versus >= 10 years since last full-term birth [1.31 (1.11-1.55)]; ever versus never use of oral contraceptives [0.91 (0.87-0.96)]; ever versus never use of menopausal hormone therapy, including current estrogen-progestin therapy [0.61 (0.54-0.69)]. Similar associations with breast cancer mortality were weaker; for example, 1.11 (1.02-1.21) for current versus never smoking.Conclusions: We confirm associations between modifiable lifestyle factors and 10-year all-cause mortality. There was no strong evidence that associations differed by ER status or intrinsic-like subtype.Impact: Given the large dataset and lack of evidence that associations between modifiable risk factors and 10-year mortality differed by subtype, these associations could be cautiously used in prognostication models to inform patient-centered care. Show less
Background Epidemiological studies provide strong evidence for a role of endogenous sex hormones in the aetiology of breast cancer. The aim of this analysis was to identify genetic variants that... Show moreBackground Epidemiological studies provide strong evidence for a role of endogenous sex hormones in the aetiology of breast cancer. The aim of this analysis was to identify genetic variants that are associated with urinary sex-hormone levels and breast cancer risk. Methods We carried out a genome-wide association study of urinary oestrone-3-glucuronide and pregnanediol-3-glucuronide levels in 560 premenopausal women, with additional analysis of progesterone levels in 298 premenopausal women. To test for the association with breast cancer risk, we carried out follow-up genotyping in 90,916 cases and 89,893 controls from the Breast Cancer Association Consortium. All women were of European ancestry. Results For pregnanediol-3-glucuronide, there were no genome-wide significant associations; for oestrone-3-glucuronide, we identified a single peak mapping to the CYP3A locus, annotated by rs45446698. The minor rs45446698-C allele was associated with lower oestrone-3-glucuronide (-49.2%, 95% CI -56.1% to -41.1%, P = 3.1 x 10(-18)); in follow-up analyses, rs45446698-C was also associated with lower progesterone (-26.7%, 95% CI -39.4% to -11.6%, P = 0.001) and reduced risk of oestrogen and progesterone receptor-positive breast cancer (OR = 0.86, 95% CI 0.82-0.91, P = 6.9 x 10(-8)). Conclusions The CYP3A7*1C allele is associated with reduced risk of hormone receptor-positive breast cancer possibly mediated via an effect on the metabolism of endogenous sex hormones in premenopausal women. Show less
Previous research has shown that polygenic risk scores (PRSs) can be used to stratify women according to their risk of developing primary invasive breast cancer. This study aimed to evaluate the... Show morePrevious research has shown that polygenic risk scores (PRSs) can be used to stratify women according to their risk of developing primary invasive breast cancer. This study aimed to evaluate the association between a recently validated PRS of 313 germline variants (PRS313) and contralateral breast cancer (CBC) risk. We included 56,068 women of European ancestry diagnosed with first invasive breast cancer from 1990 onward with follow-up from the Breast Cancer Association Consortium. Metachronous CBC risk (N = 1,027) according to the distribution of PRS313 was quantified using Cox regression analyses. We assessed PRS313 interaction with age at first diagnosis, family history, morphology, ER status, PR status, and HER2 status, and (neo)adjuvant therapy. In studies of Asian women, with limited follow-up, CBC risk associated with PRS313 was assessed using logistic regression for 340 women with CBC compared with 12,133 women with unilateral breast cancer. Higher PRS313 was associated with increased CBC risk: hazard ratio per standard deviation (SD) = 1.25 (95%CI = 1.18-1.33) for Europeans, and an OR per SD = 1.15 (95%CI = 1.02-1.29) for Asians. The absolute lifetime risks of CBC, accounting for death as competing risk, were 12.4% for European women at the 10th percentile and 20.5% at the 90th percentile of PRS313. We found no evidence of confounding by or interaction with individual characteristics, characteristics of the primary tumor, or treatment. The C-index for the PRS313 alone was 0.563 (95%CI = 0.547-0.586). In conclusion, PRS313 is an independent factor associated with CBC risk and can be incorporated into CBC risk prediction models to help improve stratification and optimize surveillance and treatment strategies. Show less
In breast cancer, high levels of homeobox protein Hox-B13 (HOXB13) have been associated with disease progression of ER-positive breast cancer patients and resistance to tamoxifen treatment. Since... Show moreIn breast cancer, high levels of homeobox protein Hox-B13 (HOXB13) have been associated with disease progression of ER-positive breast cancer patients and resistance to tamoxifen treatment. Since HOXB13 p.G84E is a prostate cancer risk allele, we evaluated the association between HOXB13 germline mutations and breast cancer risk in a previous study consisting of 3,270 familial non-BRCA1/2 breast cancer cases and 2,327 controls from the Netherlands. Although both recurrent HOXB13 mutations p.G84E and p.R217C were not associated with breast cancer risk, the risk estimation for p.R217C was not very precise. To provide more conclusive evidence regarding the role of HOXB13 in breast cancer susceptibility, we here evaluated the association between HOXB13 mutations and increased breast cancer risk within 81 studies of the international Breast Cancer Association Consortium containing 68,521 invasive breast cancer patients and 54,865 controls. Both HOXB13 p.G84E and p.R217C did not associate with the development of breast cancer in European women, neither in the overall analysis (OR = 1.035, 95% CI = 0.859-1.246, P = 0.718 and OR = 0.798, 95% CI = 0.482-1.322, P = 0.381 respectively), nor in specific high-risk subgroups or breast cancer subtypes. Thus, although involved in breast cancer progression, HOXB13 is not a material breast cancer susceptibility gene. Show less
Genome-wide analysis identifies 32 loci associated with breast cancer susceptibility, accounting for estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and... Show moreGenome-wide analysis identifies 32 loci associated with breast cancer susceptibility, accounting for estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and tumor grade.Breast cancer susceptibility variants frequently show heterogeneity in associations by tumor subtype(1-3). To identify novel loci, we performed a genome-wide association study including 133,384 breast cancer cases and 113,789 controls, plus 18,908 BRCA1 mutation carriers (9,414 with breast cancer) of European ancestry, using both standard and novel methodologies that account for underlying tumor heterogeneity by estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and tumor grade. We identified 32 novel susceptibility loci (P < 5.0 x 10(-8)), 15 of which showed evidence for associations with at least one tumor feature (false discovery rate < 0.05). Five loci showed associations (P < 0.05) in opposite directions between luminal and non-luminal subtypes. In silico analyses showed that these five loci contained cell-specific enhancers that differed between normal luminal and basal mammary cells. The genetic correlations between five intrinsic-like subtypes ranged from 0.35 to 0.80. The proportion of genome-wide chip heritability explained by all known susceptibility loci was 54.2% for luminal A-like disease and 37.6% for triple-negative disease. The odds ratios of polygenic risk scores, which included 330 variants, for the highest 1% of quantiles compared with middle quantiles were 5.63 and 3.02 for luminal A-like and triple-negative disease, respectively. These findings provide an improved understanding of genetic predisposition to breast cancer subtypes and will inform the development of subtype-specific polygenic risk scores. Show less
Fuentes, L. de las; Sung, Y.J.; Noordam, R.; Winkler, T.; Feitosa, M.F.; Schwander, K.; ... ; Lifelines Cohort Study 2020
Educational attainment is widely used as a surrogate for socioeconomic status (SES). Low SES is a risk factor for hypertension and high blood pressure (BP). To identify novel BP loci, we performed... Show moreEducational attainment is widely used as a surrogate for socioeconomic status (SES). Low SES is a risk factor for hypertension and high blood pressure (BP). To identify novel BP loci, we performed multi-ancestry meta-analyses accounting for gene-educational attainment interactions using two variables, "Some College" (yes/no) and "Graduated College" (yes/no). Interactions were evaluated using both a 1 degree of freedom (DF) interaction term and a 2DF joint test of genetic and interaction effects. Analyses were performed for systolic BP, diastolic BP, mean arterial pressure, and pulse pressure. We pursued genome-wide interrogation in Stage 1 studies (N = 117 438) and follow-up on promising variants in Stage 2 studies (N = 293 787) in five ancestry groups. Through combined meta-analyses of Stages 1 and 2, we identified 84 known and 18 novel BP loci at genome-wide significance level (P < 5 x 10(-8)). Two novel loci were identified based on the 1DF test of interaction with educational attainment, while the remaining 16 loci were identified through the 2DF joint test of genetic and interaction effects. Ten novel loci were identified in individuals of African ancestry. Several novel loci show strong biological plausibility since they involve physiologic systems implicated in BP regulation. They include genes involved in the central nervous system-adrenal signaling axis (ZDHHC17, CADPS, PIK3C2G), vascular structure and function (GNB3, CDON), and renal function (HAS2 and HAS2-AS1, SLIT3). Collectively, these findings suggest a role of educational attainment or SES in further dissection of the genetic architecture of BP. Show less
Previous transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but... Show morePrevious transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but analyses of breast cancer subtype-specific associations have been limited. In this study, we conducted a TWAS using gene expression data from GTEx and summary statistics from the hitherto largest GWAS meta-analysis conducted for breast cancer overall, and by estrogen receptor subtypes (ER+ and ER-). We further compared associations with ER+ and ER- subtypes, using a case-only TWAS approach. We also conducted multigene conditional analyses in regions with multiple TWAS associations. Two genes, STXBP4 and HIST2H2BA, were specifically associated with ER+ but not with ER- breast cancer. We further identified 30 TWAS-significant genes associated with overall breast cancer risk, including four that were not identified in previous studies. Conditional analyses identified single independent breast-cancer gene in three of six regions harboring multiple TWAS-significant genes. Our study provides new information on breast cancer genetics and biology, particularly about genomic differences between ER+ and ER- breast cancer. Show less
Fine-mapping of causal variants and integration of epigenetic and chromatin conformation data identify likely target genes for 150 breast cancer risk regions.Genome-wide association studies have... Show moreFine-mapping of causal variants and integration of epigenetic and chromatin conformation data identify likely target genes for 150 breast cancer risk regions.Genome-wide association studies have identified breast cancer risk variants in over 150 genomic regions, but the mechanisms underlying risk remain largely unknown. These regions were explored by combining association analysis with in silico genomic feature annotations. We defined 205 independent risk-associated signals with the set of credible causal variants in each one. In parallel, we used a Bayesian approach (PAINTOR) that combines genetic association, linkage disequilibrium and enriched genomic features to determine variants with high posterior probabilities of being causal. Potentially causal variants were significantly over-represented in active gene regulatory regions and transcription factor binding sites. We applied our INQUSIT pipeline for prioritizing genes as targets of those potentially causal variants, using gene expression (expression quantitative trait loci), chromatin interaction and functional annotations. Known cancer drivers, transcription factors and genes in the developmental, apoptosis, immune system and DNA integrity checkpoint gene ontology pathways were over-represented among the highest-confidence target genes. Show less