Background Genome-wide association studies (GWAS) have identified multiple common breast cancer susceptibility variants. Many of these variants have differential associations by estrogen receptor ... Show moreBackground Genome-wide association studies (GWAS) have identified multiple common breast cancer susceptibility variants. Many of these variants have differential associations by estrogen receptor (ER) status, but how these variants relate with other tumor features and intrinsic molecular subtypes is unclear. Methods Among 106,571 invasive breast cancer cases and 95,762 controls of European ancestry with data on 173 breast cancer variants identified in previous GWAS, we used novel two-stage polytomous logistic regression models to evaluate variants in relation to multiple tumor features (ER, progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) and grade) adjusting for each other, and to intrinsic-like subtypes. Results Eighty-five of 173 variants were associated with at least one tumor feature (false discovery rate < 5%), most commonly ER and grade, followed by PR and HER2. Models for intrinsic-like subtypes found nearly all of these variants (83 of 85) associated at p < 0.05 with risk for at least one luminal-like subtype, and approximately half (41 of 85) of the variants were associated with risk of at least one non-luminal subtype, including 32 variants associated with triple-negative (TN) disease. Ten variants were associated with risk of all subtypes in different magnitude. Five variants were associated with risk of luminal A-like and TN subtypes in opposite directions. Conclusion This report demonstrates a high level of complexity in the etiology heterogeneity of breast cancer susceptibility variants and can inform investigations of subtype-specific risk prediction. Show less
BACKGROUNDGenetic testing for breast cancer susceptibility is widely used, but for many genes, evidence of an association with breast cancer is weak, underlying risk estimates are imprecise, and... Show moreBACKGROUNDGenetic testing for breast cancer susceptibility is widely used, but for many genes, evidence of an association with breast cancer is weak, underlying risk estimates are imprecise, and reliable subtype-specific risk estimates are lacking.METHODSWe used a panel of 34 putative susceptibility genes to perform sequencing on samples from 60,466 women with breast cancer and 53,461 controls. In separate analyses for protein-truncating variants and rare missense variants in these genes, we estimated odds ratios for breast cancer overall and tumor subtypes. We evaluated missense-variant associations according to domain and classification of pathogenicity.RESULTSProtein-truncating variants in 5 genes (ATM, BRCA1, BRCA2, CHEK2, and PALB2) were associated with a risk of breast cancer overall with a P value of less than 0.0001. Protein-truncating variants in 4 other genes (BARD1, RAD51C, RAD51D, and TP53) were associated with a risk of breast cancer overall with a P value of less than 0.05 and a Bayesian false-discovery probability of less than 0.05. For protein-truncating variants in 19 of the remaining 25 genes, the upper limit of the 95% confidence interval of the odds ratio for breast cancer overall was less than 2.0. For protein-truncating variants in ATM and CHEK2, odds ratios were higher for estrogen receptor (ER)-positive disease than for ER-negative disease; for protein-truncating variants in BARD1, BRCA1, BRCA2, PALB2, RAD51C, and RAD51D, odds ratios were higher for ER-negative disease than for ER-positive disease. Rare missense variants (in aggregate) in ATM, CHEK2, and TP53 were associated with a risk of breast cancer overall with a P value of less than 0.001. For BRCA1, BRCA2, and TP53, missense variants (in aggregate) that would be classified as pathogenic according to standard criteria were associated with a risk of breast cancer overall, with the risk being similar to that of protein-truncating variants.CONCLUSIONSThe results of this study define the genes that are most clinically useful for inclusion on panels for the prediction of breast cancer risk, as well as provide estimates of the risks associated with protein-truncating variants, to guide genetic counseling. (Funded by European Union Horizon 2020 programs and others.) Show less
Genome-wide analysis identifies 32 loci associated with breast cancer susceptibility, accounting for estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and... Show moreGenome-wide analysis identifies 32 loci associated with breast cancer susceptibility, accounting for estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and tumor grade.Breast cancer susceptibility variants frequently show heterogeneity in associations by tumor subtype(1-3). To identify novel loci, we performed a genome-wide association study including 133,384 breast cancer cases and 113,789 controls, plus 18,908 BRCA1 mutation carriers (9,414 with breast cancer) of European ancestry, using both standard and novel methodologies that account for underlying tumor heterogeneity by estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and tumor grade. We identified 32 novel susceptibility loci (P < 5.0 x 10(-8)), 15 of which showed evidence for associations with at least one tumor feature (false discovery rate < 0.05). Five loci showed associations (P < 0.05) in opposite directions between luminal and non-luminal subtypes. In silico analyses showed that these five loci contained cell-specific enhancers that differed between normal luminal and basal mammary cells. The genetic correlations between five intrinsic-like subtypes ranged from 0.35 to 0.80. The proportion of genome-wide chip heritability explained by all known susceptibility loci was 54.2% for luminal A-like disease and 37.6% for triple-negative disease. The odds ratios of polygenic risk scores, which included 330 variants, for the highest 1% of quantiles compared with middle quantiles were 5.63 and 3.02 for luminal A-like and triple-negative disease, respectively. These findings provide an improved understanding of genetic predisposition to breast cancer subtypes and will inform the development of subtype-specific polygenic risk scores. Show less
Fine-mapping of causal variants and integration of epigenetic and chromatin conformation data identify likely target genes for 150 breast cancer risk regions.Genome-wide association studies have... Show moreFine-mapping of causal variants and integration of epigenetic and chromatin conformation data identify likely target genes for 150 breast cancer risk regions.Genome-wide association studies have identified breast cancer risk variants in over 150 genomic regions, but the mechanisms underlying risk remain largely unknown. These regions were explored by combining association analysis with in silico genomic feature annotations. We defined 205 independent risk-associated signals with the set of credible causal variants in each one. In parallel, we used a Bayesian approach (PAINTOR) that combines genetic association, linkage disequilibrium and enriched genomic features to determine variants with high posterior probabilities of being causal. Potentially causal variants were significantly over-represented in active gene regulatory regions and transcription factor binding sites. We applied our INQUSIT pipeline for prioritizing genes as targets of those potentially causal variants, using gene expression (expression quantitative trait loci), chromatin interaction and functional annotations. Known cancer drivers, transcription factors and genes in the developmental, apoptosis, immune system and DNA integrity checkpoint gene ontology pathways were over-represented among the highest-confidence target genes. Show less
BACKGROUND: We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry.METHODS: Meta-analyses included summary... Show moreBACKGROUND: We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry.METHODS: Meta-analyses included summary estimates based on Cox models of twelve datasets using similar to 10.4 million variants for 96,661 women with breast cancer and 7697 events (breast cancer-specific deaths). Oestrogen receptor (ER)-specific analyses were based on 64,171 ER-positive (4116) and 16,172 ER-negative (2125) patients. We evaluated the probability of a signal to be a true positive using the Bayesian false discovery probability (BFDP).RESULTS: We did not find any variant associated with breast cancer-specific mortality at P<5 x 10(-8). For ER-positive disease, the most significantly associated variant was chr7:rs4717568 (BFDP = 7%, P = 1.28 x 10(-7), hazard ratio [HR] = 0.88, 95% confidence interval [ CI] = 0.84-0.92); the closest gene is AUTS2. For ER-negative disease, the most significant variant was chr7: rs67918676 (BFDP = 11%, P = 1.38 x 10(-7), HR = 1.27, 95% CI = 1.16-1.39); located within a long intergenic non-coding RNA gene (AC004009.3), close to the HOXA gene cluster.CONCLUSIONS: We uncovered germline variants on chromosome 7 at BFDP <15% close to genes for which there is biological evidence related to breast cancer outcome. However, the paucity of variants associated with mortality at genome-wide significance underpins the challenge in providing genetic-based individualised prognostic information for breast cancer patients. Show less
Quantifying the genetic correlation between cancers can provide important insights into themechanisms driving cancer etiology. Using genome-wide association study summary sta-tistics across six... Show moreQuantifying the genetic correlation between cancers can provide important insights into themechanisms driving cancer etiology. Using genome-wide association study summary sta-tistics across six cancer types based on a total of 296,215 cases and 301,319 controls ofEuropean ancestry, here we estimate the pair-wise genetic correlations between breast,colorectal, head/neck, lung, ovary and prostate cancer, and between cancers and 38 otherdiseases. We observed statistically significant genetic correlations between lung and head/neck cancer (rg = 0.57, p = 4.6 × 10−8), breast and ovarian cancer (rg = 0.24, p = 7 × 10−5 ),breast and lung cancer (rg = 0.18, p =1.5 × 10−6) and breast and colorectal cancer (rg = 0.15,p = 1.1 × 10−4 ). We also found that multiple cancers are genetically correlated with non-cancer traits including smoking, psychiatric diseases and metabolic characteristics. Functionalenrichment analysis revealed a significant excess contribution of conserved and regulatoryregions to cancer heritability. Our comprehensive analysis of cross-cancer heritability sug-gests that solid tumors arising across tissues share in part a common germline genetic basis. Show less
Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for... Show moreStratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57-1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628-0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs. Show less