CELLO

Use CELLO2 to predict glioma evolution

CELLO2 contains curated clinical and somatic genomic information of longitudinally paired gliomas

CELLO2 is developed and maintained by Wang-Lab@HKUST.

Please enter gene name:

Group by:

Plot longitudinal pairs

Gene expression

Genomic alteration heatmap

Cases Overview

Please enter patient ID:

Clinical information

Mutations

Copy Number Alterations

Follow the three steps to use this tool:

Preparation: download the template and fill the form.
Imputation: upload the filled form and click “Imputed data” to view the data.
Prediction: click “Predicted result” to view the predicted HM status.

Load example

Download the template Template

Download the feature description Description

Upload the file (.csv/.txt)

Browse...

Select the read.table parameters below

Header

Separator

Comma

Semicolon

Tab

Space

Download data

Below we provide the raw data that we published in the STM2023 paper for research use.

Clinical information of the cohort: Click here
Somatic mutations of the glioma samples: Click here
mRNA expression of the initial gliomas: Click here
mRNA expression of the recurrent gliomas: Click here
Updated grading (WHO 2021) of the TCGA LGG cohort: Click here

Citation:

If you use these data in your publication, please consider cite:

Mu, Quanhua, et al. “Identifying predictors of glioma evolution from longitudinal sequencing.” Science Translational Medicine 15.716 (2023): eadh4181.

Introduction

CELLO2 is an interactive web server to visualize curated clinical and somatic genomic information of longitudinally paired gliomas, and to explore the probability of developing treatment-induced hypermutation using models that we trained from the collected data. The app is hosted on shinyapps.io at http://www.wang-lab-hkust.com:3838/cello2.

The server is built in R and Shiny, and is right now under active updating. In this web site, you can:

Click Genes panel to explore genomic alterations and RNA expression of one specific gene;
Click Cases panel to explore clinical and genetic information of patients;
Click CELLO2 panel to explore the probability of developing treatment-induced hypermutation using models that we trained from the collected data;

In this page you can visualize the genomic alteration and RNA expression of one specific gene in the initial and recurrent tumors of glioma patients. It is also possible to link the changes in genomic alteration and/or gene expression with several important clinical features such as molecular subtype, pathological grade, grade progression and development of hypermutation.

Visualize genomic alteration and gene expression

In the left control panel, you can input the gene name, such as IDH1, then click submit. At the right side of the page, you will see two figures displaying the gene expression (upper panel) and genomic alteration (lower panel) in the cohort. By default, the figures are separated into two facets representing hypermutation status at recurrence. Select to group by Molecular_Subtype and we can see the gene expression data are shown in scatter plot, and each point represents the expression level of one sample, in the unit of log2 RPKM. The genomic alteration data are shown in a heatmap. It is clear to see that IDH1 mutation are present in all IDH-mutant-codel and IDH-mutant-non-codel patients but in none of the IDHwt patients:

Regarding genomic alteration, we included data for mutations, copy number alterations and several other important alterations such as FGFR3-TACC3 fusion, PTPRZ1-MET fusion, EGFR*vIII, *MET*ex14, *MGMT fusions and hypermutation (HM). For example, to check hypermutated samples in this cohort, you can simply type HM in the gene name box.

Group by clinical features for comparison

In addition to molecular subtype, we can also explore other clinical features. For example, enter MKI67 in the gene name box, and select to group by Hypermutation_at_recurrence, from the results we can see higher frequency of MKI67 overexpression in patients that develop hypermutation:

An additional useful feature is to show the longitudinal pairs in the gene expression plot. This will add a link between the initial and recurrent tumor of each patient. For example, if we check VEGFA expression and Grade_Progression, we will see significant elevation of VEGFA expression during grade progression:

In the cases panel, there is an overview of the cases, which shows the time point of each surgery as well as the overall survival. In the input box, one can input a patient ID to view the clinical data, somatic mutations, and copy number alterations.

The “CELLO2” panel performs imputation and prediction of hypermutation (HM) using clinical and genomic profiles.

“example” patients prediction

To make it easier to understand the process of imputation and prediction of hypermutation (HM), we create “example” button which enables users to access the predicted results of three patients.

Click the “example” button.
The “example” patients will be displayed in “Upload data”. The “NA” (not available data) will be highlighted by yellow.
NAs will be imputed by ISEM (iteratively sequential ensemble machine) and users need to click the “Imputed data” panel. The imputation data is colored by red.
After imputation, users can click the “Predicted result” panel to acquire the likelihood of harbouring HM (the left digital dashboard), ranging from 0 (non-HM) to 1 (HM).
Users can access the results of one specific patient by selecting the patient ID from “Choose ID of sample”.
The predicted results of all example patients can be download by clicking the “download” button on the bottom.

“upload” patients prediction

Users can explore the HM and Grade status of patients in their own cohort.

Prepare the data:

Download the template (click the “Template” button).
Fill in the corresponding information according to the 46 features (the description can be downloaded by clicking the “Description” button). For unknown or unavailable features, users can leave blank without filling in the numerical value.

Note: Each row means each patient, and CELLO 2 supports submission of multiple samples implemented by multiple rows.
Upload the data: the csv and txt file can be uploaded.
The upload data can be seen in “Upload data” panel. The “NA” (not available data) will be highlighted by yellow.
NAs will be imputed by ISEM (iteratively sequential ensemble machine) and users need to click the “Imputed data” panel. The imputation data is colored by red.
The probability of harbouring HM ranging from 0 (non-HM) to 1 (HM).
CELLO2 supports batch download, which can enable users to freely get all the predicted results by clicking “download” button on the bottom.

CELLO2: Cancer EvoLution from machine learning of LOngitudinal sequencing

CELLO2 is developed by Yingxi Yang, Quanhua Mu, and Zhihan Zhu in Wang-Lab@HKUST. It further expands the functionality of our previous CELLO toolkit^*.

Correspondence: jgwang@ust.hk

Abstract

Clonal evolution drives cancer progression and therapeutic resistance. Recent studies revealed divergent longitudinal trajectories in gliomas but early molecular traits that steer post-treatment cancer evolution remain unclear. We comprehensively analyzed sequencing data of 544 initial-recurrent adult diffuse glioma pairs to identify genomic and transcriptomic early predictors of tumor evolution in each molecular subtype and developed machine-learning methods capable of predicting hypermutation. We validated the association between the top predictor, c-MYC gain / MYC targets activation, and hypermutagenesis by TMZ resistance induction experiments in glioma cell lines and an isogenic model manipulating patient-derived gliomaspheres. We demonstrated that c-Myc, binding to open chromatin and transcriptionally active genomic regions, increases the vulnerability of key mismatch repair genes to TMZ-induced mutagenesis, thus triggering hypermutation. This study reveals early predictors of cancer evolution under therapy and provides rich resources for precision oncology targeting cancer dynamics.

Sources of the published datasets

Reference	PubMed ID	# tumor pairs	Link to Data
Mu et al. Identifying predictors of glioma evolution from longitudinal sequencing. Sci Transl Med. 15.716 (2023): eadh4181.	37792958	107	EGAS00001006894
Varn et al. Glioma progression is shaped by genetic evolution and microenvironment interactions. Cell 185.12 (2022):2184-2199 Barthel et al. Longitudinal molecular trajectories of diffuse glioma in adults. Nature 576.7785 (2019): 112-120.	35649412 31748746	172^{^}	syn17038081
Jonsson et al. Genomic Correlates of Disease Progression and Treatment Response in Prospectively Characterized Gliomas. Clinical Cancer Research 25.18 (2019): 5537-5547.	31263031	67	glioma_mskcc_2019
Zhao et al. Immune and genomic correlates of response to anti-PD-1 immunotherapy in glioblastoma. Nature Medicine 25.3 (2019): 462-469.	30742119	14	PRJNA482620
Ceccarelli et al. Molecular profiling reveals biologically discrete subsets and pathways of progression in diffuse glioma. Cell 164.3 (2016): 550-563	26824661	27	TCGA-GBM, TCGA-LGG
Bai et al. Integrated genomic characterization of IDH1-mutant glioma malignant progression. Nature Genetics 48.1 (2016): 59-66.	26618343	41	EGAS00001001588
Wang et al. Clonal evolution of glioblastoma under therapy. Nature Genetics 48.7 (2016): 768-776.	27270107	40	EGAS00001001800, SRP074425
Suzuki et al. Mutational landscape and clonal architecture in grade II and III gliomas. Nature Genetics 47.5 (2015): 458-468.	25848751	9	EGAS00001001044
Kim et al. Whole-genome and multisector exome sequencing of primary and post-treatment glioblastoma reveals patterns of tumor evolution. Genome Research 25.3 (2015): 316-327.	25650244	10	EGAS00001001033
Kim et al. Spatiotemporal evolution of the primary glioblastoma genome. Cancer Cell 28.3 (2015): 318-328.	26373279	34	EGAS00001001041
Johnson et al. Mutational analysis reveals the origin and therapy-driven evolution of recurrent glioma. Science 343.6167 (2014): 189-193.	24336570	23	EGAS00001000579

^: not including GLASS samples that are overlapping with other sources.

*: Jiang B., Song D., Mu Q., & Wang J. (2020). CELLO: a longitudinal data analysis toolbox untangling cancer evolution. Quantitative Biology, 8(3), 256-266.