Background Genome-wide association studies (GWAS) have identified multiple common breast cancer susceptibility variants. Many of these variants have differential associations by estrogen receptor ... Show moreBackground Genome-wide association studies (GWAS) have identified multiple common breast cancer susceptibility variants. Many of these variants have differential associations by estrogen receptor (ER) status, but how these variants relate with other tumor features and intrinsic molecular subtypes is unclear. Methods Among 106,571 invasive breast cancer cases and 95,762 controls of European ancestry with data on 173 breast cancer variants identified in previous GWAS, we used novel two-stage polytomous logistic regression models to evaluate variants in relation to multiple tumor features (ER, progesterone receptor (PR), human epidermal growth factor receptor 2 (HER2) and grade) adjusting for each other, and to intrinsic-like subtypes. Results Eighty-five of 173 variants were associated with at least one tumor feature (false discovery rate < 5%), most commonly ER and grade, followed by PR and HER2. Models for intrinsic-like subtypes found nearly all of these variants (83 of 85) associated at p < 0.05 with risk for at least one luminal-like subtype, and approximately half (41 of 85) of the variants were associated with risk of at least one non-luminal subtype, including 32 variants associated with triple-negative (TN) disease. Ten variants were associated with risk of all subtypes in different magnitude. Five variants were associated with risk of luminal A-like and TN subtypes in opposite directions. Conclusion This report demonstrates a high level of complexity in the etiology heterogeneity of breast cancer susceptibility variants and can inform investigations of subtype-specific risk prediction. Show less
Background Given the high heterogeneity among breast tumors, associations between common germline genetic variants and survival that may exist within specific subgroups could go undetected in an... Show moreBackground Given the high heterogeneity among breast tumors, associations between common germline genetic variants and survival that may exist within specific subgroups could go undetected in an unstratified set of breast cancer patients. Methods We performed genome-wide association analyses within 15 subgroups of breast cancer patients based on prognostic factors, including hormone receptors, tumor grade, age, and type of systemic treatment. Analyses were based on 91,686 female patients of European ancestry from the Breast Cancer Association Consortium, including 7531 breast cancer-specific deaths over a median follow-up of 8.1 years. Cox regression was used to assess associations of common germline variants with 15-year and 5-year breast cancer-specific survival. We assessed the probability of these associations being true positives via the Bayesian false discovery probability (BFDP < 0.15). Results Evidence of associations with breast cancer-specific survival was observed in three patient subgroups, with variant rs5934618 in patients with grade 3 tumors (15-year-hazard ratio (HR) [95% confidence interval (CI)] 1.32 [1.20, 1.45], P = 1.4E-08, BFDP = 0.01, per G allele); variant rs4679741 in patients with ER-positive tumors treated with endocrine therapy (15-year-HR [95% CI] 1.18 [1.11, 1.26], P = 1.6E-07, BFDP = 0.09, per G allele); variants rs1106333 (15-year-HR [95% CI] 1.68 [1.39,2.03], P = 5.6E-08, BFDP = 0.12, per A allele) and rs78754389 (5-year-HR [95% CI] 1.79 [1.46,2.20], P = 1.7E-08, BFDP = 0.07, per A allele), in patients with ER-negative tumors treated with chemotherapy. Conclusions We found evidence of four loci associated with breast cancer-specific survival within three patient subgroups. There was limited evidence for the existence of associations in other patient subgroups. However, the power for many subgroups is limited due to the low number of events. Even so, our results suggest that the impact of common germline genetic variants on breast cancer-specific survival might be limited. Show less
Genome-wide analysis identifies 32 loci associated with breast cancer susceptibility, accounting for estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and... Show moreGenome-wide analysis identifies 32 loci associated with breast cancer susceptibility, accounting for estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and tumor grade.Breast cancer susceptibility variants frequently show heterogeneity in associations by tumor subtype(1-3). To identify novel loci, we performed a genome-wide association study including 133,384 breast cancer cases and 113,789 controls, plus 18,908 BRCA1 mutation carriers (9,414 with breast cancer) of European ancestry, using both standard and novel methodologies that account for underlying tumor heterogeneity by estrogen receptor, progesterone receptor and human epidermal growth factor receptor 2 status and tumor grade. We identified 32 novel susceptibility loci (P < 5.0 x 10(-8)), 15 of which showed evidence for associations with at least one tumor feature (false discovery rate < 0.05). Five loci showed associations (P < 0.05) in opposite directions between luminal and non-luminal subtypes. In silico analyses showed that these five loci contained cell-specific enhancers that differed between normal luminal and basal mammary cells. The genetic correlations between five intrinsic-like subtypes ranged from 0.35 to 0.80. The proportion of genome-wide chip heritability explained by all known susceptibility loci was 54.2% for luminal A-like disease and 37.6% for triple-negative disease. The odds ratios of polygenic risk scores, which included 330 variants, for the highest 1% of quantiles compared with middle quantiles were 5.63 and 3.02 for luminal A-like and triple-negative disease, respectively. These findings provide an improved understanding of genetic predisposition to breast cancer subtypes and will inform the development of subtype-specific polygenic risk scores. Show less
Fine-mapping of causal variants and integration of epigenetic and chromatin conformation data identify likely target genes for 150 breast cancer risk regions.Genome-wide association studies have... Show moreFine-mapping of causal variants and integration of epigenetic and chromatin conformation data identify likely target genes for 150 breast cancer risk regions.Genome-wide association studies have identified breast cancer risk variants in over 150 genomic regions, but the mechanisms underlying risk remain largely unknown. These regions were explored by combining association analysis with in silico genomic feature annotations. We defined 205 independent risk-associated signals with the set of credible causal variants in each one. In parallel, we used a Bayesian approach (PAINTOR) that combines genetic association, linkage disequilibrium and enriched genomic features to determine variants with high posterior probabilities of being causal. Potentially causal variants were significantly over-represented in active gene regulatory regions and transcription factor binding sites. We applied our INQUSIT pipeline for prioritizing genes as targets of those potentially causal variants, using gene expression (expression quantitative trait loci), chromatin interaction and functional annotations. Known cancer drivers, transcription factors and genes in the developmental, apoptosis, immune system and DNA integrity checkpoint gene ontology pathways were over-represented among the highest-confidence target genes. Show less
Background: In addition to the established association between general obesity and breast cancer risk, central obesity and circulating fasting insulin and glucose have been linked to the... Show moreBackground: In addition to the established association between general obesity and breast cancer risk, central obesity and circulating fasting insulin and glucose have been linked to the development of this common malignancy. Findings from previous studies, however, have been inconsistent, and the nature of the associations is unclear.Methods: We conducted Mendelian randomization analyses to evaluate the association of breast cancer risk, using genetic instruments, with fasting insulin, fasting glucose, 2-h glucose, body mass index (BMI) and BMI-adjusted waist-hip-ratio (WHRadj BMI). We first confirmed the association of these instruments with type 2 diabetes risk in a large diabetes genome-wide association study consortium. We then investigated their associations with breast cancer risk using individual-level data obtained from 98 842 cases and 83 464 controls of European descent in the Breast Cancer Association Consortium.Results: All sets of instruments were associated with risk of type 2 diabetes. Associations with breast cancer risk were found for genetically predicted fasting insulin [odds ratio (OR) = 1.71 per standard deviation (SD) increase, 95% confidence interval (CI) = 1.26-2.31, p = 5.09 x 10(-4)], 2-h glucose (OR = 1.80 per SD increase, 95% CI = 1.3 0-2.49, p = 4.02 x 10(-4)), BMI (OR = 0.70 per 5-unit increase, 95% CI = 0.65-0.76, p = 5.05 x 10(-19)) and WHRadj BMI (OR = 0.85, 95% CI = 0.79-0.91, p = 9.22 x 10(-6)). Stratified analyses showed that genetically predicted fasting insulin was more closely related to risk of estrogen-receptor [ER]-positive cancer, whereas the associations with instruments of 2h glucose, BMI and WHRadj BMI were consistent regardless of age, menopausal status, estrogen receptor status and family history of breast cancer.Conclusions: We confirmed the previously reported inverse association of genetically predicted BMI with breast cancer risk, and showed a positive association of genetically predicted fasting insulin and 2-h glucose and an inverse association of WHRadj BMI with breast cancer risk. Our study suggests that genetically determined obesity and glucose/insulin-related traits have an important role in the aetiology of breast cancer. Show less
BACKGROUND: We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry.METHODS: Meta-analyses included summary... Show moreBACKGROUND: We examined the associations between germline variants and breast cancer mortality using a large meta-analysis of women of European ancestry.METHODS: Meta-analyses included summary estimates based on Cox models of twelve datasets using similar to 10.4 million variants for 96,661 women with breast cancer and 7697 events (breast cancer-specific deaths). Oestrogen receptor (ER)-specific analyses were based on 64,171 ER-positive (4116) and 16,172 ER-negative (2125) patients. We evaluated the probability of a signal to be a true positive using the Bayesian false discovery probability (BFDP).RESULTS: We did not find any variant associated with breast cancer-specific mortality at P<5 x 10(-8). For ER-positive disease, the most significantly associated variant was chr7:rs4717568 (BFDP = 7%, P = 1.28 x 10(-7), hazard ratio [HR] = 0.88, 95% confidence interval [ CI] = 0.84-0.92); the closest gene is AUTS2. For ER-negative disease, the most significant variant was chr7: rs67918676 (BFDP = 11%, P = 1.38 x 10(-7), HR = 1.27, 95% CI = 1.16-1.39); located within a long intergenic non-coding RNA gene (AC004009.3), close to the HOXA gene cluster.CONCLUSIONS: We uncovered germline variants on chromosome 7 at BFDP <15% close to genes for which there is biological evidence related to breast cancer outcome. However, the paucity of variants associated with mortality at genome-wide significance underpins the challenge in providing genetic-based individualised prognostic information for breast cancer patients. Show less
Stratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for... Show moreStratification of women according to their risk of breast cancer based on polygenic risk scores (PRSs) could improve screening and prevention strategies. Our aim was to develop PRSs, optimized for prediction of estrogen receptor (ER)-specific disease, from the largest available genome-wide association dataset and to empirically validate the PRSs in prospective studies. The development dataset comprised 94,075 case subjects and 75,017 control subjects of European ancestry from 69 studies, divided into training and validation sets. Samples were genotyped using genome-wide arrays, and single-nucleotide polymorphisms (SNPs) were selected by stepwise regression or lasso penalized regression. The best performing PRSs were validated in an independent test set comprising 11,428 case subjects and 18,323 control subjects from 10 prospective studies and 190,040 women from UK Biobank (3,215 incident breast cancers). For the best PRSs (313 SNPs), the odds ratio for overall disease per 1 standard deviation in ten prospective studies was 1.61 (95%CI: 1.57-1.65) with area under receiver-operator curve (AUC) = 0.630 (95%CI: 0.628-0.651). The lifetime risk of overall breast cancer in the top centile of the PRSs was 32.6%. Compared with women in the middle quintile, those in the highest 1% of risk had 4.37- and 2.78-fold risks, and those in the lowest 1% of risk had 0.16- and 0.27-fold risks, of developing ER-positive and ER-negative disease, respectively. Goodness-of-fit tests indicated that this PRS was well calibrated and predicts disease risk accurately in the tails of the distribution. This PRS is a powerful and reliable predictor of breast cancer risk that may improve breast cancer prevention programs. Show less