Background Previous phylogeographic studies of the lion (Panthera leo) have improved our insight into the distribution of genetic variation, as well as a revised taxonomy which now recognizes a... Show moreBackground Previous phylogeographic studies of the lion (Panthera leo) have improved our insight into the distribution of genetic variation, as well as a revised taxonomy which now recognizes a northern (Panthera leo leo) and a southern (Panthera leo melanochaita) subspecies. However, existing whole range phylogeographic studies on lions either consist of very limited numbers of samples, or are focused on mitochondrial DNA and/or a limited set of microsatellites. The geographic extent of genetic lineages and their phylogenetic relationships remain uncertain, clouded by massive sampling gaps, sex-biased dispersal and incomplete lineage sorting. Results In this study we present results of low depth whole genome sequencing and subsequent variant calling in ten lions sampled throughout the geographic range, resulting in the discovery of >150,000 Single Nucleotide Polymorphisms (SNPs). Phylogenetic analyses revealed the same basal split between northern and southern populations, as well as four population clusters on a more local scale. Further, we designed a SNP panel, including 125 autosomal and 14 mitochondrial SNPs, which was tested on >200 lions from across their range. Results allow us to assign individuals to one of these four major clades (West & Central Africa, India, East Africa, or Southern Africa) and delineate these clades in more detail. Conclusions The results presented here, particularly the validated SNP panel, have important applications, not only for studying populations on a local geographic scale, but also for tracing samples of unknown origin for forensic purposes, and for guiding conservation management of ex situ populations. Thus, these genomic resources not only contribute to our understanding of the evolutionary history of the lion, but may also play a crucial role in conservation efforts aimed at protecting the species in its full diversity. Show less
Bertola, L.D.; Vermaat, M.; Lesilau, F.; Chege, M.; Tumenta, P.N.; Sogbohossou, E.A.; ... ; Vrieling, K. 2022
Background Previous phylogeographic studies of the lion (Panthera leo) have improved our insight into the distribution of genetic variation, as well as a revised taxonomy which now recognizes a... Show moreBackground Previous phylogeographic studies of the lion (Panthera leo) have improved our insight into the distribution of genetic variation, as well as a revised taxonomy which now recognizes a northern (Panthera leo leo) and a southern (Panthera leo melanochaita) subspecies. However, existing whole range phylogeographic studies on lions either consist of very limited numbers of samples, or are focused on mitochondrial DNA and/or a limited set of microsatellites. The geographic extent of genetic lineages and their phylogenetic relationships remain uncertain, clouded by massive sampling gaps, sex-biased dispersal and incomplete lineage sorting. Results In this study we present results of low depth whole genome sequencing and subsequent variant calling in ten lions sampled throughout the geographic range, resulting in the discovery of >150,000 Single Nucleotide Polymorphisms (SNPs). Phylogenetic analyses revealed the same basal split between northern and southern populations, as well as four population clusters on a more local scale. Further, we designed a SNP panel, including 125 autosomal and 14 mitochondrial SNPs, which was tested on >200 lions from across their range. Results allow us to assign individuals to one of these four major clades (West & Central Africa, India, East Africa, or Southern Africa) and delineate these clades in more detail. Conclusions The results presented here, particularly the validated SNP panel, have important applications, not only for studying populations on a local geographic scale, but also for tracing samples of unknown origin for forensic purposes, and for guiding conservation management of ex situ populations. Thus, these genomic resources not only contribute to our understanding of the evolutionary history of the lion, but may also play a crucial role in conservation efforts aimed at protecting the species in its full diversity. Show less
Lefter, M.; Vis, J.K.; Vermaat, M.; Dunnen, J.T. den; Taschner, P.E.M.; Laros, J.F.J. 2021
Motivation: Unambiguous variant descriptions are of utmost importance in clinical genetic diagnostics, scientific literature and genetic databases. The Human Genome Variation Society (HGVS)... Show moreMotivation: Unambiguous variant descriptions are of utmost importance in clinical genetic diagnostics, scientific literature and genetic databases. The Human Genome Variation Society (HGVS) publishes a comprehensive set of guidelines on how variants should be correctly and unambiguously described. We present the implementation of the Mutalyzer 2 tool suite, designed to automatically apply the HGVS guidelines so users do not have to deal with the HGVS intricacies explicitly to check and correct their variant descriptions.Results: Mutalyzer is profusely used by the community, having processed over 133 million descriptions since its launch. Over a five year period, Mutalyzer reported a correct input in similar to 50% of cases. In 41% of the cases either a syntactic or semantic error was identified and for similar to 7% of cases, Mutalyzer was able to automatically correct the description. Show less
Background: Clinicians need to rapidly and reliably diagnose coronavirus disease 2019 (COVID-19) for proper risk stratification, isolation strategies, and treatment decisions.Purpose: To assess the... Show moreBackground: Clinicians need to rapidly and reliably diagnose coronavirus disease 2019 (COVID-19) for proper risk stratification, isolation strategies, and treatment decisions.Purpose: To assess the real-life performance of radiologist emergency department chest CT interpretation for diagnosing COVID-19 during the acute phase of the pandemic, using the COVID-19 Reporting and Data System (CO-RADS).Materials and Methods: This retrospective multicenter study included consecutive patients who presented to emergency departments in six medical centers between March and April 2020 with moderate to severe upper respiratory symptoms suspicious for COVID-19. As part of clinical practice, chest CT scans were obtained for primary work-up and scored using the five-point CO-RADS scheme for suspicion of COVID-19. CT was compared with severe acute respiratory syndrome coronavirus 2 reverse-transcription polymerase chain reaction (RT-PCR) assay and a clinical reference standard established by a multidisciplinary group of clinicians based on RT-PCR, COVID-19 contact history, oxygen therapy, timing of RT-PCR testing, and likely alternative diagnosis. Performance of CT was estimated using area under the receiver operating characteristic curve (AUC) analysis and diagnostic odds ratios against both reference standards. Subgroup analysis was performed on the basis of symptom duration grouped presentations of less than 48 hours, 48 hours through 7 days, and more than 7 days.Results: A total of 1070 patients (median age, 66 years; interquartile range, 54-75 years; 626 men) were included, of whom 536 (50%) had a positive RT-PCR result and 137 (13%) of whom were considered to have a possible or probable COVID-19 diagnosis based on the clinical reference standard. Chest CT yielded an AUC of 0.87 (95% CI: 0.84, 0.89) compared with RT-PCR and 0.87(95% CI: 0.85, 0.89) compared with the clinical reference standard. A CO-RADS score of 4 or greater yielded an odds ratio of 25.9 (95% CI: 18.7, 35.9) for a COVID-19 diagnosis with RT-PCR and an odds ratio of 30.6 (95% CI: 21.1, 44.4) with the clinical reference standard. For symptom duration of less than 48 hours, the AUC fell to 0.71 (95% CI: 0.62, 0.80; P =.001).Conclusion: Chest CT analysis using the coronavirus disease 2019 (COVID-19) Reporting and Data System enables rapid and reliable diagnosis of COVID-19, particularly when symptom duration is greater than 48 hours. (C) RSNA, 2020 Show less
Lessmann, N.; Sanchez, C.I.; Beenen, L.; Boulogne, L.H.; Brink, M.; Calli, E.; ... ; Ginneken, B. van 2021
Background: The coronavirus disease 2019 (COVID-19) pandemic has spread across the globe with alarming speed, morbidity, and mortality. Immediate triage of patients with chest infections suspected... Show moreBackground: The coronavirus disease 2019 (COVID-19) pandemic has spread across the globe with alarming speed, morbidity, and mortality. Immediate triage of patients with chest infections suspected to be caused by COVID-19 using chest CT may be of assistance when results from definitive viral testing are delayed.Purpose: To develop and validate an artificial intelligence (AI) system to score the likelihood and extent of pulmonary COVID-19 on chest CT scans using the COVID-19 Reporting and Data System (CO-RADS) and CT severity scoring systems.Materials and Methods: The CO-RADS AI system consists of three deep-learning algorithms that automatically segment the five pulmonary lobes, assign a CO-RADS score for the suspicion of COVID-19, and assign a CT severity score for the degree of parenchymal involvement per lobe. This study retrospectively included patients who underwent a nonenhanced chest CT examination because of clinical suspicion of COVID-19 at two medical centers. The system was trained, validated, and tested with data from one of the centers. Data from the second center served as an external test set. Diagnostic performance and agreement with scores assigned by eight independent observers were measured using receiver operating characteristic analysis, linearly weighted kappa values, and classification accuracy.Results: A total of 105 patients (mean age, 62 years +/- 16 [standard deviation]; 61 men) and 262 patients (mean age, 64 years +/- 16; 154 men) were evaluated in the internal and external test sets, respectively. The system discriminated between patients with COVID-19 and those without COVID-19, with areas under the receiver operating characteristic curve of 0.95 (95% CI: 0.91, 0.98) and 0.88 (95% CI: 0.84, 0.93), for the internal and external test sets, respectively. Agreement with the eight human observers was moderate to substantial, with mean linearly weighted k values of 0.60 +/- 0.01 for CO-RADS scores and 0.54 +/- 0.01 for CT severity scores.Conclusion: With high diagnostic performance, the CO-RADS AI system correctly identified patients with COVID-19 using chest CT scans and assigned standardized CO-RADS and CT severity scores that demonstrated good agreement with findings from eight independent observers and generalized well to external data. (C) RSNA, 2020 Show less
Insights into individual differences in gene expression and its heritability (h(2)) can help in understanding pathways from DNA to phenotype. We estimated the heritability of gene expression of 52... Show moreInsights into individual differences in gene expression and its heritability (h(2)) can help in understanding pathways from DNA to phenotype. We estimated the heritability of gene expression of 52,844 genes measured in whole blood in the largest twin RNA-Seq sample to date (1497 individuals including 459 monozygotic twin pairs and 150 dizygotic twin pairs) from classical twin modeling and identity-by-state-based approaches. We estimated for each gene h(total)(2), composed of cis-heritability (h(cis)(2), the variance explained by single nucleotide polymorphisms in the cis-window of the gene), and trans-heritability (h(res)(2), the residual variance explained by all other genome-wide variants). Mean h(total)(2) was 0.26, which was significantly higher than heritability estimates earlier found in a microarray-based study using largely overlapping (>60%) RNA samples (mean h(2) = 0.14, p = 6.15 x 10(-258)). Mean h(cis)(2) was 0.06 and strongly correlated with beta of the top cis expression quantitative loci (eQTL, rho = 0.76, p < 10(-308)) and with estimates from earlier RNA-Seq-based studies. Mean h(res)(2) was 0.20 and correlated with the beta of the corresponding trans-eQTL (rho = 0.04, p < 1.89 x 10(-3)) and was significantly higher for genes involved in cytokine-cytokine interactions (p = 4.22 x 10(-15)), many other immune system pathways, and genes identified in genome-wide association studies for various traits including behavioral disorders and cancer. This study provides a thorough characterization of cis- and trans-h(2) estimates of gene expression, which is of value for interpretation of GWAS and gene expression studies. Show less
Telomere length (TL) regulation is an important factor in ageing, reproduction and cancer development. Genetic, hereditary and environmental factors regulating TL are currently widely investigated,... Show moreTelomere length (TL) regulation is an important factor in ageing, reproduction and cancer development. Genetic, hereditary and environmental factors regulating TL are currently widely investigated, however, their relative contribution to TL variability is still understudied. We have used whole genome sequencing data of 250 family trios from the Genome of the Netherlands project to perform computational measurement of TL and a series of regression and genome-wide association analyses to reveal TL inheritance patterns and associated genetic factors. Our results confirm that TL is a largely heritable trait, primarily with mother's, and, to a lesser extent, with father's TL having the strongest influence on the offspring. In this cohort, mother's, but not father's age at conception was positively linked to offspring TL. Age-related TL attrition of 40 bp/year had relatively small influence on TL variability. Finally, we have identified TL-associated variations in ribonuclease reductase catalytic subunit M1 (RRM1 gene), which is known to regulate telomere maintenance in yeast. We also highlight the importance of multivariate approach and the limitations of existing tools for the analysis of TL as a polygenic heritable quantitative trait. Show less
Kim et al. identify novel genes and disease pathways in the forebrain developmental disorder holoprosencephaly, and show that many cases involve oligogenic inheritance. The findings underline the... Show moreKim et al. identify novel genes and disease pathways in the forebrain developmental disorder holoprosencephaly, and show that many cases involve oligogenic inheritance. The findings underline the roles of Sonic Hedgehog and primary cilia in forebrain development, and show that integrating clinical phenotyping into genetic studies can uncover relevant mutations.Holoprosencephaly is a pathology of forebrain development characterized by high phenotypic heterogeneity. The disease presents with various clinical manifestations at the cerebral or facial levels. Several genes have been implicated in holoprosencephaly but its genetic basis remains unclear: different transmission patterns have been described including autosomal dominant, recessive and digenic inheritance. Conventional molecular testing approaches result in a very low diagnostic yield and most cases remain unsolved. In our study, we address the possibility that genetically unsolved cases of holoprosencephaly present an oligogenic origin and result from combined inherited mutations in several genes. Twenty-six unrelated families, for whom no genetic cause of holoprosencephaly could be identified in clinical settings [whole exome sequencing and comparative genomic hybridization (CGH)-array analyses], were reanalysed under the hypothesis of oligogenic inheritance. Standard variant analysis was improved with a gene prioritization strategy based on clinical ontologies and gene co-expression networks. Clinical phenotyping and exploration of cross-species similarities were further performed on a family-by-family basis. Statistical validation was performed on 248 ancestrally similar control trios provided by the Genome of the Netherlands project and on 574 ancestrally matched controls provided by the French Exome Project. Variants of clinical interest were identified in 180 genes significantly associated with key pathways of forebrain development including sonic hedgehog (SHH) and primary cilia. Oligogenic events were observed in 10 families and involved both known and novel holoprosencephaly genes including recurrently mutated FAT1, NDST1, COL2A1 and SCUBE2. The incidence of oligogenic combinations was significantly higher in holoprosencephaly patients compared to two control populations (P < 10(9)). We also show that depending on the affected genes, patients present with particular clinical features. This study reports novel disease genes and supports oligogenicity as clinically relevant model in holoprosencephaly. It also highlights key roles of SHH signalling and primary cilia in forebrain development. We hypothesize that distinction between different clinical manifestations of holoprosencephaly lies in the degree of overall functional impact on SHH signalling. Finally, we underline that integrating clinical phenotyping in genetic studies is a powerful tool to specify the clinical relevance of certain mutations. Show less
X-inactivation is a well-established dosage compensation mechanism ensuring that X-chromosomal genes are expressed at comparable levels in males and females. Skewed X-inactivation is often... Show moreX-inactivation is a well-established dosage compensation mechanism ensuring that X-chromosomal genes are expressed at comparable levels in males and females. Skewed X-inactivation is often explained by negative selection of one of the alleles. We demonstrate that imbalanced expression of the paternal and maternal X-chromosomes is common in the general population and that the random nature of the X-inactivation mechanism can be sufficient to explain the imbalance. To this end, we analyzed blood-derived RNA and whole-genome sequencing data from 79 female children and their parents from the Genome of the Netherlands project. We calculated the median ratio of the paternal over total counts at all X-chromosomal heterozygous single-nucleotide variants with coverage ≥10. We identified two individuals where the same X-chromosome was inactivated in all cells. Imbalanced expression of the two X-chromosomes (ratios ≤0.35 or ≥0.65) was observed in nearly 50% of the population. The empirically observed skewing is explained by a theoretical model where X-inactivation takes place in an embryonic stage in which eight cells give rise to the hematopoietic compartment. Genes escaping X-inactivation are expressed from both alleles and therefore demonstrate less skewing than inactivated genes. Using this characteristic, we identified three novel escapee genes (SSR4, REPS2, and SEPT6), but did not find support for many previously reported escapee genes in blood. Our collective data suggest that skewed X-inactivation is common in the general population. This may contribute to manifestation of symptoms in carriers of recessive X-linked disorders. We recommend that X-inactivation results should not be used lightly in the interpretation of X-linked variants. Show less
Anvar, S.Y.; Allard, G.; Tseng, E.; Sheynkman, G.M.; Klerk, E. de; Vermaat, M.; ... ; Hoen, P.A.C. 't 2018