Previous genome-wide association studies (GWASs) of stroke - the second leading cause of death worldwide - were conducted predominantly in populations of European ancestry(1,2). Here, in cross... Show morePrevious genome-wide association studies (GWASs) of stroke - the second leading cause of death worldwide - were conducted predominantly in populations of European ancestry(1,2). Here, in cross-ancestry GWAS meta-analyses of 110,182 patients who have had a stroke (five ancestries, 33% non-European) and 1,503,898 control individuals, we identify association signals for stroke and its subtypes at 89 (61 new) independent loci: 60 in primary inverse-variance-weighted analyses and 29 in secondary meta-regression and multitrait analyses. On the basis of internal cross-ancestry validation and an independent follow-up in 89,084 additional cases of stroke (30% non-European) and 1,013,843 control individuals, 87% of the primary stroke risk loci and 60% of the secondary stroke risk loci were replicated (P < 0.05). Effect sizes were highly correlated across ancestries. Cross-ancestry fine-mapping, in silico mutagenesis analysis(3), and transcriptome-wide and proteome-wide association analyses revealed putative causal genes (such as SH3PXD2A and FURIN) and variants (such as at GRK5 and NOS3). Using a three-pronged approach(4), we provide genetic evidence for putative drug effects, highlighting F11, KLKB1, PROC, GP1BA, LAMC2 and VCAM1 as possible targets, with drugs already under investigation for stroke for F11 and PROC. A polygenic score integrating cross-ancestry and ancestry-specific stroke GWASs with vascular-risk factor GWASs (integrative polygenic scores) strongly predicted ischaemic stroke in populations of European, East Asian and African ancestry(5). Stroke genetic risk scores were predictive of ischaemic stroke independent of clinical risk factors in 52,600 clinical-trial participants with cardiometabolic disease. Our results provide insights to inform biology, reveal potential drug targets and derive genetic risk prediction tools across ancestries. Show less
Ruth, K.S.; Day, F.R.; Hussain, J.; Martinez-Marchal, A.; Aiken, C.E.; Azad, A.; ... ; 23 Me Res Team 2021
Reproductive longevity is essential for fertility and influences healthy ageing in women(1,2), but insights into its underlying biological mechanisms and treatments to preserve it are limited. Here... Show moreReproductive longevity is essential for fertility and influences healthy ageing in women(1,2), but insights into its underlying biological mechanisms and treatments to preserve it are limited. Here we identify 290 genetic determinants of ovarian ageing, assessed using normal variation in age at natural menopause in approximately 200,000 women of European ancestry. These common alleles were associated with clinical extremes of age at natural menopause; women in the top 1% of genetic susceptibility have an equivalent risk of premature ovarian insufficiency to those carrying monogenic FMR1 premutations(3). The identified loci implicate a broad range of DNA damage response (DDR) processes and include loss-of-function variants in key DDR-associated genes. Integration with experimental models demonstrates that these DDR processes act across the life-course to shape the ovarian reserve and its rate of depletion. Furthermore, we demonstrate that experimental manipulation of DDR pathways highlighted by human genetics increases fertility and extends reproductive life in mice. Causal inference analyses using the identified genetic variants indicate that extending reproductive life in women improves bone health and reduces risk of type 2 diabetes, but increases the risk of hormone-sensitive cancers. These findings provide insight into the mechanisms that govern ovarian ageing, when they act, and how they might be targeted by therapeutic approaches to extend fertility and prevent disease. Show less
Previous transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but... Show morePrevious transcriptome-wide association studies (TWAS) have identified breast cancer risk genes by integrating data from expression quantitative loci and genome-wide association studies (GWAS), but analyses of breast cancer subtype-specific associations have been limited. In this study, we conducted a TWAS using gene expression data from GTEx and summary statistics from the hitherto largest GWAS meta-analysis conducted for breast cancer overall, and by estrogen receptor subtypes (ER+ and ER-). We further compared associations with ER+ and ER- subtypes, using a case-only TWAS approach. We also conducted multigene conditional analyses in regions with multiple TWAS associations. Two genes, STXBP4 and HIST2H2BA, were specifically associated with ER+ but not with ER- breast cancer. We further identified 30 TWAS-significant genes associated with overall breast cancer risk, including four that were not identified in previous studies. Conditional analyses identified single independent breast-cancer gene in three of six regions harboring multiple TWAS-significant genes. Our study provides new information on breast cancer genetics and biology, particularly about genomic differences between ER+ and ER- breast cancer. Show less
Venous thromboembolism (VTE) is a significant contributor to morbidity and mortality. To advance our understanding of the biology contributing to VTE, we conducted a genome-wide association study ... Show moreVenous thromboembolism (VTE) is a significant contributor to morbidity and mortality. To advance our understanding of the biology contributing to VTE, we conducted a genome-wide association study (GWAS) of VTE and a transcriptome-wide association study (TWAS) based on imputed gene expression from whole blood and liver. Wemeta-analyzedGWAS data from18 studies for 30 234 VTE cases and 172 122 controls and assessed the association between 12 923 718 genetic variants and VTE. We generated variant prediction scores of gene expression from whole blood and liver tissue and assessed them for association with VTE. Mendelian randomization analyses were conducted for traits genetically associated with novel VTE loci. We identified 34 independent genetic signals for VTE risk from GWAS meta-analysis, of which 14 are newly reported associations. This included 11 newly associated genetic loci (C1orf198, PLEK, OSMR-AS1, NUGGC/SCARA5, GRK5, MPHOSPH9, ARID4A, PLCG2, SMG6, EIF5A, and STX10) of which 6 replicated, and 3 new independent signals in 3 known genes. Further, TWAS identified 5 additional genetic loci with imputed gene expression levels differing between cases and controls in whole blood (SH2B3, SPSB1, RP11-747H7.3, RP4-737E23.2) and in liver (ERAP1). At some GWAS loci, we found suggestive evidence that the VTE association signal for novel and previously known regions colocalized with expression quantitative trait locus signals. Mendelian randomization analyses suggested that blood traits may contribute to the underlying risk of VTE. To conclude, we identified 16 novel susceptibility loci for VTE; for some loci, the association signals are likely mediated through gene expression of nearby genes. Show less
Fanconi anemia (FA) is a genetically heterogeneous disorder with 22 disease-causing genes reported to date. In some FA genes, monoallelic mutations have been found to be associated with breast... Show moreFanconi anemia (FA) is a genetically heterogeneous disorder with 22 disease-causing genes reported to date. In some FA genes, monoallelic mutations have been found to be associated with breast cancer risk, while the risk associations of others remain unknown. The gene for FA type C, FANCC, has been proposed as a breast cancer susceptibility gene based on epidemiological and sequencing studies. We used the Oncoarray project to genotype two truncating FANCC variants (p.R185X and p.R548X) in 64,760 breast cancer cases and 49,793 controls of European descent. FANCC mutations were observed in 25 cases (14 with p.R185X, 11 with p.R548X) and 26 controls (18 with p.R185X, 8 with p.R548X). There was no evidence of an association with the risk of breast cancer, neither overall (odds ratio 0.77, 95% CI 0.44-1.33, p = 0.4) nor by histology, hormone receptor status, age or family history. We conclude that the breast cancer risk association of these two FANCC variants, if any, is much smaller than for BRCA1, BRCA2 or PALB2 mutations. If this applies to all truncating variants in FANCC it would suggest there are differences between FA genes in their roles on breast cancer risk and demonstrates the merit of large consortia for clarifying risk associations of rare variants. Show less
Background: In addition to the established association between general obesity and breast cancer risk, central obesity and circulating fasting insulin and glucose have been linked to the... Show moreBackground: In addition to the established association between general obesity and breast cancer risk, central obesity and circulating fasting insulin and glucose have been linked to the development of this common malignancy. Findings from previous studies, however, have been inconsistent, and the nature of the associations is unclear.Methods: We conducted Mendelian randomization analyses to evaluate the association of breast cancer risk, using genetic instruments, with fasting insulin, fasting glucose, 2-h glucose, body mass index (BMI) and BMI-adjusted waist-hip-ratio (WHRadj BMI). We first confirmed the association of these instruments with type 2 diabetes risk in a large diabetes genome-wide association study consortium. We then investigated their associations with breast cancer risk using individual-level data obtained from 98 842 cases and 83 464 controls of European descent in the Breast Cancer Association Consortium.Results: All sets of instruments were associated with risk of type 2 diabetes. Associations with breast cancer risk were found for genetically predicted fasting insulin [odds ratio (OR) = 1.71 per standard deviation (SD) increase, 95% confidence interval (CI) = 1.26-2.31, p = 5.09 x 10(-4)], 2-h glucose (OR = 1.80 per SD increase, 95% CI = 1.3 0-2.49, p = 4.02 x 10(-4)), BMI (OR = 0.70 per 5-unit increase, 95% CI = 0.65-0.76, p = 5.05 x 10(-19)) and WHRadj BMI (OR = 0.85, 95% CI = 0.79-0.91, p = 9.22 x 10(-6)). Stratified analyses showed that genetically predicted fasting insulin was more closely related to risk of estrogen-receptor [ER]-positive cancer, whereas the associations with instruments of 2h glucose, BMI and WHRadj BMI were consistent regardless of age, menopausal status, estrogen receptor status and family history of breast cancer.Conclusions: We confirmed the previously reported inverse association of genetically predicted BMI with breast cancer risk, and showed a positive association of genetically predicted fasting insulin and 2-h glucose and an inverse association of WHRadj BMI with breast cancer risk. Our study suggests that genetically determined obesity and glucose/insulin-related traits have an important role in the aetiology of breast cancer. Show less
Genome-wide association studies (GWAS) have identified more than 170 breast cancer susceptibility loci. Here we hypothesize that some risk-associated variants might act in non-breast tissues,... Show moreGenome-wide association studies (GWAS) have identified more than 170 breast cancer susceptibility loci. Here we hypothesize that some risk-associated variants might act in non-breast tissues, specifically adipose tissue and immune cells from blood and spleen. Using expression quantitative trait loci (eQTL) reported in these tissues, we identify 26 previously unreported, likely target genes of overall breast cancer risk variants, and 17 for estrogen receptor (ER)-negative breast cancer, several with a known immune function. We determine the directional effect of gene expression on disease risk measured based on single and multiple eQTL. In addition, using a gene-based test of association that considers eQTL from multiple tissues, we identify seven (and four) regions with variants associated with overall (and ER-negative) breast cancer risk, which were not reported in previous GWAS. Further investigation of the function of the implicated genes in breast and immune cells may provide insights into the etiology of breast cancer. Show less
BACKGROUND: We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry.METHODS: Meta-analyses included summary... Show moreBACKGROUND: We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry.METHODS: Meta-analyses included summary estimates based on Cox models of twelve datasets using similar to 10.4 million variants for 96,661 women with breast cancer and 7697 events (breast cancer-specific deaths). Oestrogen receptor (ER)-specific analyses were based on 64,171 ER-positive (4116) and 16,172 ER-negative (2125) patients. We evaluated the probability of a signal to be a true positive using the Bayesian false discovery probability (BFDP).RESULTS: We did not find any variant associated with breast cancer-specific mortality at P<5 x 10(-8). For ER-positive disease, the most significantly associated variant was chr7:rs4717568 (BFDP = 7%, P = 1.28 x 10(-7), hazard ratio [HR] = 0.88, 95% confidence interval [ CI] = 0.84-0.92); the closest gene is AUTS2. For ER-negative disease, the most significant variant was chr7: rs67918676 (BFDP = 11%, P = 1.38 x 10(-7), HR = 1.27, 95% CI = 1.16-1.39); located within a long intergenic non-coding RNA gene (AC004009.3), close to the HOXA gene cluster.CONCLUSIONS: We uncovered germline variants on chromosome 7 at BFDP <15% close to genes for which there is biological evidence related to breast cancer outcome. However, the paucity of variants associated with mortality at genome-wide significance underpins the challenge in providing genetic-based individualised prognostic information for breast cancer patients. Show less
Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for... Show moreStratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57-1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628-0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs. Show less