Gene co-expression measurements are widely used in computational biology to identify coordinated expression patterns across a group of samples. Coordinated expression of genes may indicate that... Show moreGene co-expression measurements are widely used in computational biology to identify coordinated expression patterns across a group of samples. Coordinated expression of genes may indicate that they are controlled by the same transcriptional regulatory program, or involved in common biological processes. Gene co-expression is generally estimated from RNA-Sequencing data, which are commonly normalized to remove technical variability. Here, we demonstrate that certain normalization methods, in particular quantile-based methods, can introduce false-positive associations between genes. These false-positive associations can consequently hamper downstream co-expression network analysis. Quantile-based normalization can, however, be extremely powerful. In particular, when preprocessing large-scale heterogeneous data, quantile-based normalization methods such as smooth quantile normalization can be applied to remove technical variability while maintaining global differences in expression for samples with different biological attributes. Show less
Characterizing inter-tumor heterogeneity is crucial for selecting suitable cancer therapy, as the presence of diverse molecular subgroups of patients can be associated with disease outcome or... Show moreCharacterizing inter-tumor heterogeneity is crucial for selecting suitable cancer therapy, as the presence of diverse molecular subgroups of patients can be associated with disease outcome or response to treatment. While cancer subtypes are often characterized by differences in gene expression, the mechanisms driving these differences are generally unknown. We set out to model the regulatory mechanisms driving sarcoma heterogeneity based on patient-specific, genome-wide gene regulatory networks. We developed a new computational framework, PORCUPINE, which combines knowledge on biological pathways with permutation-based network analysis to identify pathways that exhibit significant regulatory heterogeneity across a patient population. We applied PORCUPINE to patient-specific leiomyosarcoma networks modeled on data from The Cancer Genome Atlas and validated our results in an independent dataset from the German Cancer Research Center. PORCUPINE identified 37 heterogeneously regulated pathways, including pathways representing potential targets for treatment of subgroups of leiomyosarcoma patients, such as FGFR and CTLA4 inhibitory signaling. We validated the detected regulatory heterogeneity through analysis of networks and chromatin states in leiomyosarcoma cell lines. We showed that the heterogeneity identified with PORCUPINE is not associated with methylation profiles or clinical features, thereby suggesting an independent mechanism of patient heterogeneity driven by the complex landscape of gene regulatory interactions. Show less
Guebila, M. ben; Wang, T.; Lopes-Ramos, C.M.; Fanfani, V.; Weighill, D.; Burkholz, R.; ... ; Quackenbush, J. 2023
Inference and analysis of gene regulatory networks (GRNs) require software that integrates multi-omic data from various sources. The Network Zoo (netZoo; netzoo.github.io) is a collection of open... Show moreInference and analysis of gene regulatory networks (GRNs) require software that integrates multi-omic data from various sources. The Network Zoo (netZoo; netzoo.github.io) is a collection of open-source methods to infer GRNs, conduct differential network analyses, estimate community structure, and explore the transitions between biological states. The netZoo builds on our ongoing development of network methods, harmonizing the implementations in various computing languages and between methods to allow better integration of these tools into analytical pipelines. We demonstrate the utility using multi-omic data from the Cancer Cell Line Encyclopedia. We will continue to expand the netZoo to incorporate additional methods. Show less
Birkeälv, S.; Harland, M.; Matsuyama, L.S.A.S.; Rashid, M.; Mehta, I.; Laye, J.P.; ... ; Adams, D.J. 2023
B and T cell receptor (immune) repertoires can represent an individual’s immune history. While current repertoire analysis methods aim to discriminate between health and disease states, they are... Show moreB and T cell receptor (immune) repertoires can represent an individual’s immune history. While current repertoire analysis methods aim to discriminate between health and disease states, they are typically based on only a limited number of parameters. Here, we introduce immuneREF: a quantitative multidimensional measure of adaptive immune repertoire (and transcriptome) similarity that allows interpretation of immune repertoire variation by relying on both repertoire features and cross-referencing of simulated and experimental datasets. To quantify immune repertoire similarity landscapes across health and disease, we applied immuneREF to >2,400 datasets from individuals with varying immune states (healthy, [autoimmune] disease, and infection). We discovered, in contrast to the current paradigm, that blood-derived immune repertoires of healthy and diseased individuals are highly similar for certain immune states, suggesting that repertoire changes to immune perturbations are less pronounced than previously thought. In conclusion, immuneREF enables the population-wide study of adaptive immune response similarity across immune states. Show less
MotivationCharacterizing cells with rare molecular phenotypes is one of the promises of high throughput single-cell RNA sequencing (scRNA-seq) techniques. However, collecting enough cells with the... Show moreMotivationCharacterizing cells with rare molecular phenotypes is one of the promises of high throughput single-cell RNA sequencing (scRNA-seq) techniques. However, collecting enough cells with the desired molecular phenotype in a single experiment is challenging, requiring several samples preprocessing steps to filter and collect the desired cells experimentally before sequencing. Data integration of multiple public single-cell experiments stands as a solution for this problem, allowing the collection of enough cells exhibiting the desired molecular signatures. By increasing the sample size of the desired cell type, this approach enables a robust cell type transcriptome characterization.ResultsHere, we introduce rPanglaoDB, an R package to download and merge the uniformly processed and annotated scRNA-seq data provided by the PanglaoDB database. To show the potential of rPanglaoDB for collecting rare cell types by integrating multiple public datasets, we present a biological application collecting and characterizing a set of 157 fibrocytes. Fibrocytes are a rare monocyte-derived cell type, that exhibits both the inflammatory features of macrophages and the tissue remodeling properties of fibroblasts. This constitutes the first fibrocytes’ unbiased transcriptome profile report. We compared the transcriptomic profile of the fibrocytes against the fibroblasts collected from the same tissue samples and confirm their associated relationship with healing processes in tissue damage and infection through the activation of the prostaglandin biosynthesis and regulation pathway. Show less
Guebila, M. ben; Morgan, D.C.; Glass, K.; Kuijjer, M.L.; DeMeo, D.L.; Quackenbush, J. 2022
Gene regulatory network inference allows for the modeling of genome-scale regulatory processes that are altered during development, in disease, and in response to perturbations. Our group has... Show moreGene regulatory network inference allows for the modeling of genome-scale regulatory processes that are altered during development, in disease, and in response to perturbations. Our group has developed a collection of tools to model various regulatory processes, including transcriptional (PANDA, SPIDER) and post-transcriptional (PUMA) gene regulation, as well as gene regulation in individual samples (LIONESS). These methods work by postulating a network structure and then optimizing that structure to be consistent with multiple lines of biological evidence through repeated operations on data matrices. Although our methods are widely used, the corresponding computational complexity, and the associated costs and run times, do limit some applications. To improve the cost/time performance of these algorithms, we developed gpuZoo which implements GPU-accelerated calculations, dramatically improving the performance of these algorithms. The runtime of the gpuZoo implementation in MATLAB and Python is up to 61 times faster and 28 times less expensive than multi-core CPU implementation of the same methods. gpuZoo is available in MATLAB through the netZooM package https://github.com/netZoo/netZooM and in Python through the netZooPy package https://github.com/netZoo/netZooPy. Show less
Guebila, M. ben; Lopes-Ramos, C.M.; Weighill, D.; Sonawane, A.R.; Burkholz, R.; Shamsaei, B.; ... ; Quackenbush, J. 2022
Gene regulation plays a fundamental role in shaping tissue identity, function, and response to perturbation. Regulatory processes are controlled by complex networks of interacting elements,... Show moreGene regulation plays a fundamental role in shaping tissue identity, function, and response to perturbation. Regulatory processes are controlled by complex networks of interacting elements, including transcription factors, miRNAs and their target genes. The structure of these networks helps to determine phenotypes and can ultimately influence the development of disease or response to therapy. We developed GRAND (https://grand.networkmedicine.org) as a database for computationally-inferred, context-specific gene regulatory network models that can be compared between biological states, or used to predict which drugs produce changes in regulatory network structure. The database includes 12 468 genome-scale networks covering 36 human tissues, 28 cancers, 1378 unperturbed cell lines, as well as 173 013 TF and gene targeting scores for 2858 small molecule-induced cell line perturbation paired with phenotypic information. GRAND allows the networks to be queried using phenotypic information and visualized using a variety of interactive tools. In addition, it includes a web application that matches disease states to potentially therapeutic small molecule drugs using regulatory network properties. Show less
Oost, S. van; Meijer, D.M.; Kuijjer, M.L.; Bovee, J.V.M.G.; Miranda, N.F.C.C. de 2021
Sarcomas comprise a collection of highly heterogeneous malignancies that can be grossly grouped in the categories of sarcomas with simple or complex genomes. Since the outcome for most sarcoma... Show moreSarcomas comprise a collection of highly heterogeneous malignancies that can be grossly grouped in the categories of sarcomas with simple or complex genomes. Since the outcome for most sarcoma patients has barely improved in the last decades, there is an urgent need for improved therapies. Immunotherapy, and especially T cell checkpoint blockade, has recently been a game-changer in cancer therapy as it produced significant and durable treatment responses in several cancer types. Currently, only a small fraction of sarcoma patients benefit from immunotherapy, supposedly due to a general lack of somatically mutated antigens (neoantigens) and spontaneous T cell immunity in most cancers. However, genomic events resulting from chromosomal instability are frequent in sarcomas with complex genomes and could drive immunity in those tumors. Improving our understanding of the mechanisms that shape the immune landscape of sarcomas will be crucial to overcoming the current challenges of sarcoma immunotherapy. This review focuses on what is currently known about the tumor microenvironment in sarcomas and how this relates to their genomic features. Moreover, we discuss novel therapeutic strategies that leverage the tumor microenvironment to increase the clinical efficacy of immunotherapy, and which could provide new avenues for the treatment of sarcomas. Show less
Conventional high-grade osteosarcoma is the most common primary bone cancer with relatively high incidence in young people. Recurrent and metastatic tumors are difficult to treat. We performed a... Show moreConventional high-grade osteosarcoma is the most common primary bone cancer with relatively high incidence in young people. Recurrent and metastatic tumors are difficult to treat. We performed a kinase inhibitor screen in two osteosarcoma cell lines, which identified MEK1/2 inhibitors. These inhibitors were further validated in a panel of six osteosarcoma cell lines. Western blot analysis was performed to assess ERK activity and efficacy of MEK inhibition. A 3D culture system was used to validate results from 2D monolayer cultures. Gene expression analysis was performed to identify differentially expressed gene signatures in sensitive and resistant cell lines. Activation of the AKT signaling network was explored using Western blot and pharmacological inhibition. In the screen, Trametinib, AZD8330 and TAK-733 decreased cell viability by more than 50%. Validation in six osteosarcoma cell lines identified three cell lines as resistant and three as sensitive to the inhibitors. Western blot analysis of ERK activity revealed that sensitive lines had high constitutive ERK activity. Treatment with the three MEK inhibitors in a 3D culture system validated efficacy in inhibition of osteosarcoma viability. MEK1/2 inhibition represents a candidate treatment strategy for osteosarcomas displaying high MEK activity as determined by ERK phosphorylation status. Show less
Background: In vitro expanded mesenchymal stromal cells (MSCs) are increasingly used as experimental cellular therapy. However, there have been concerns regarding the safety of their use,... Show moreBackground: In vitro expanded mesenchymal stromal cells (MSCs) are increasingly used as experimental cellular therapy. However, there have been concerns regarding the safety of their use, particularly with regard to possible oncogenic transformation. MSCs are the hypothesized precursor cells of high-grade osteosarcoma, a tumor with often complex karyotypes occurring mainly in adolescents and young adults.Methods: To determine if MSCs from osteosarcoma patients could be predisposed to malignant transformation we cultured MSCs of nine osteosarcoma patients and five healthy donors for an average of 649 days (range 601679 days). Also, we compared MSCs derived from osteosarcoma patients at diagnosis and from healthy donors using genome wide gene expression profiling.Results: Upon increasing passage, increasing frequencies of binucleate cells were detected, but no increase in proliferation suggestive of malignant transformation occurred in MSCs from either patients or donors. Hematopoietic cell specific Lyn substrate 1 (HLCS1) was differentially expressed (fold change 0.25, P value 0.0005) between MSCs of osteosarcoma patients (n = 14) and healthy donors (n = 9).Conclusions: This study shows that although HCLS1 expression was downregulated in MSCs of osteosarcoma patients and binucleate cells were present in both patient and donor derived MSCs, there was no evidence of neoplastic changes to occur during long-term culture. Show less
Wouters, M.C.A.; Dijkgraaf, E.M.; Kuijjer, M.L.; Jordanova, E.S.; Hollema, H.; Welters, M.J.P.; ... ; Burg, S. van der 2014