Introduction
With the improvement of global average standard of living and the ability to diagnosis and treatment, colorectal cancer has gradually become the third most common tumor in past few decades [
1,
2]. The risk factors may be related to obesity, lifestyle and genetic factor [
1,
2]. Except for the improvement of diagnosis, the reason for the rise of incidence rate has not been fully understood [
1,
2]. The treatment of colorectal cancer revolves around surgery. Depended on the stage of the disease, strategies may be combined with chemotherapy, radiotherapy, biological therapy, and immunotherapy [
2]. As a key component of local treatment for colorectal cancer, radiotherapy, especially in neoadjuvant therapy of rectal cancer, aims to reduce local recurrence and improve survival for patients [
2]. However, it is frequently observed in clinical practice that recurring lesions are more resistant to radiation than primary tumor and tend to proliferate faster. The reason may be that a group of special tumor cells were generated during the treatment process, which may exhibit resistance to radiation therapy and ultimately lead to treatment failure, tumor progression and poor prognosis in patients.
The above-mentioned phenomenon is a manifestation of tumor plasticity, which is probably related to cancer stem cells (CSCs), making tumor adapt to adverse environments including anti-cancer treatments induced stress, and being observed as therapeutic resistance and tumor progression eventually [
3‐
5]. In other words, some tumor cells have undergone phenotypic evolution or natural selection (therapeutic-resistant subtypes that already existed in the early stages of tumorigenesis but were not dominant) under the pressure of treatment [
3,
6]. There were also theories describing the treatment-resistant tumor cells as persistent cells or slow cycling cells [
7‐
10]. These cells could escape treatment stress and enter a dormant phase during treatment to stop proliferation. When treatment was stopped, they re-entered the proliferative cycle and led to tumor recurrence, being a major component of treatment-resistant pool. Recently, there were also theories calling this process adaptive evolution, in which tumor cells increased mutations and responded to stress, and healthy subclones were screened [
11]. This phenomenon has been described in therapy of several kinds of cancer, e.g., targeted therapy of lung cancer [
12], hormone therapy of prostate cancer [
13], immunotherapy and targeted therapy of melanoma [
14]. These theoretical models all point to a group of residual drug-resistant cells, which become the source of therapeutic resistance and mediate tumor cells repopulation after anti-tumor therapy when sensitive cells died. Although radiotherapy is rarely used as an intervention in these theoretical models, the course of some colorectal cancer patients suggest that radiotherapy may also produce a group of radiation resistant cells which participate in tumor recurrence (tumor cell repopulation). Therefore, searching for biomarkers involved in radiotherapy resistance is of great significance for judging the treatment effect and tumor progression.
Given that some colorectal cancer patients have similar phenomena, from tumor regression after radiotherapy to recurrence (radiation tolerance), we focused our attention on radiation-resistance of colorectal cancer in this research. By studying the relationship between radiation-resistance and tumor repopulation, we explored some key genes that played a part in both of them, so as to predict the prognosis of colorectal cancer. In addition, we verified these genes in other types of tumors and in virto experiments to determine the reliability in predicting radiation-resistance and tumor progression.
Materials and methods
Project selection and data collection
In order to acquire the necessary data for this study, we used the public functional genomics data repository, Gene Expression Ominibus. By setting “colorectal cancer”, “radiation” and “resistance” as three keywords for obtaining desired project, we enrolled series GSE97543 as our mainly analyzing dataset eventually [
15]. In this project, gene expression of radiation resistant colorectal cell line and its control group were described, and all GEO data were downloaded through package “GEOquery” of R software [
16].
To further get gene expression and clinical data of patients, part of cases originated in TCGA (The Cancer Genome Atlas) program were involved. We collected gene expression data of colon, rectum and rectosigmoid junction adenocarcinoma (TCGA-COAD, TCGA-READ) cases, including 644 tumor samples and 51 normal samples to search target genes with GSE97543. Meanwhile, radiation treated TCGA-CO/READ patients (n = 35) were set as training set. Patients who received external radiotherapy as only adjuvant therapy in BRCA (breast carcinoma, n = 38), LUAD (lung adenocarcinoma, n = 36), CESC (cervical squamous cell carcinoma and endocervical adenocarcinoma, n = 20), HNSC (head and neck squamous cell carcinoma, n = 95) and ESCA (esophageal carcinoma, n = 17) were introduced to be validating parts. All TCGA data, mRNA expression and clinical details were manually downloaded from the website and organized by R software.
Founding differential expression genes
Two R packages “limma” and “edgeR” were adopted to identify the differential expression genes (DEGs) in both databases [
17,
18]. |Log2 fold change|≥ 1 and P. value ≤ 0.05 were set as the selection criterion, which defined the scope of DEGs in two databases for further analysis. And volcano plots were draw by R packages “limma” and “ggplot2”. A gene annotation and analysis resource website, “Metascape” (
https://www.metascape.org/), was utilized to cluster gene function and to enrich pathways related to DEGs [
19].
Cox Regression analysis and construction of prognostic prediction mode
After acquiring 252 upregulated co-expression genes, which were candidates to construct radiation resistance gene panel, between GSE97543 and TCGA-CO/READ, we involved univariate Cox regression to pick out the most relevant genes with PFS (progression-free survival) of radiation treated colorectal patients with P. value < 0.06, which was set to prevent missing out clinically significant genes that closely approached the conventional significance level [
20,
21].
Then, we developed a proportional hazards regression analysis (multivariate Cox model) by the following formula using the result of univariate Cox regression.
$$Risk\, score= {\sum }_{i=1}^{n}(Coefficienti\times xi)$$
In this equation, risk score is the mRNA expression of each key predictor genes (xi) multiplied by the coefficient (Coefficienti), the latter came from multivariate Cox regression analysis. Using the median value of risk score that calculated by the formula before, the 35 TCGA patients in the training set were divided into two groups, high risk group and low risk group. Based on this grouping routine, we observed the expression levels of key genes in the two groups and established a survival analysis between PFS and risk score. ROC (receiver operating character) curves were implemented to evaluate the predictive ability of radiation resistance gene panel using R package “survivalROC”, and the AUC (area under the curve) values were calculated to visualize inspection capabilities.
In order to test whether risk score was an independent prognostic factor, we incorporated the age, gender, stage, T, N, M status and risk score of 35 TCGA patients into the multivariate Cox regression model, so as to exclude confounding caused by other clinical features. Furthermore, R package “rms” was used to make a nomogram model, showing above clinical information.
Testing the applicability of radiation resistance gene panel
Besides TCGA-CO/READ, we further chose patients who received radiotherapy as only adjuvant therapy from other TCGA projects to check validity of radiation resistance gene panel. Logistic regression models were built by R package “glmnet”, and different gene combination models were listed. Several best models and their AUC values of different cancer types were displayed respectively.
WGCNA analysis
The R package, “WGCNA” (Weighted Gene Co-expression Network Analysis), was utilized to build a co-expression network targeting DEGs. This method aims to find co-expressed gene modules, and to explore the relationship between gene networks and the phenotype of interest, as well as the core genes in the network [
22]. It is divided into two parts: expression cluster analysis and phenotypic correlation.
The specific method was that selecting patients who received radiotherapy in TCGA-COREAD, and incorporating the top ten thousand DEGs with the largest variance of these patients into the cluster analysis of gene expression. After clustering the samples and eliminating outliers, we determined the soft-thresholding power according to the algorithm, which was used to construct a scale-free co-expression network and determine gene modules. Each color represented a module, and each module contained genes with similar expression patterns. Dynamic tree cut analysis was developed to represent the classification of genes, and modules with high similarity were fused to construct merged dynamic clusters. In addition, different modules were established the correlation to two clinical traits, PFS and risk score, and modules that contained key radiation resistant genes were selected to carry out GO analysis.
Cell culture
Immortalized human colorectal tumor cell lines (HCT 116 and HT-29) and 293 T cells used in this research were bought from cells bank of the Chinese Academy of Science (Shanghai, China). HCT 116, HT-29 and 293 T cells were seeded in completed DMEM medium (Wisent, China). 10% Fetal Bovine Serum (Wisent, China) and 1% Penicillin–Streptomycin (Wisent, China) were added to the completed DMEM medium. All cells were cultured in 5% CO2 at 37 ℃.
Cell Irradiation and colony formation assay
An X-Ray generator (Faxitron, USA) for laboratory was used to treat cells. And the dose rate was 3 Gy/min. HCT 116 and HT-29 were seeded in six well culture dishes (100, 200, 1000, 2500 and 10,000 cells/well) and incubated eight hours (overnight) until adherent before irradiation. After adherent, cells were treated with various doses (0, 2, 4, 6, 8 Gy) of X-Ray respectively. All cells were fixed by paraformaldehyde of 4% fortnight later. And then, they were stained by crystal violet. This experiment was repeated three times independently. Any colony that contained fifty cells or more were counted and linear-quadratic model in Graphpad Prism 8 was employed to calculate the surviving fraction.
Detection the expression level of mRNA by RT-qPCR
Total mRNA was extracted by using RNA Easy Fast Cell Kit (Tiangen, China). cDNA was reversely transcribed from mRNA by using the RT Kit. Q-PCR was conducted by using TB Green Kit. Relative mRNA level was calculated by the formula 2 − △△CT. The experiments were repeated three times. Both kits mentioned above were from Takara, Japan. Specific primers information is clarified in supplemental material. GAPDH, an internal reference gene, were used as a control for standardization.
ShRNA transfected and knockdown cell lines constructed
All shRNA sequences were got from MERCK website. When the confluence of 293 T cells in the cell culture dish reached 80–90%, we mixed the target plasmid containing shRNA, tool plasmid psPAX2, pMD2.G and GFP fluorescent plasmid into the 400 μL Opti-MEM medium according to the protocol proportion, and then added the mixed system into 293 T cell culture medium. After 48 h, the virus solution from 293 T cells was obtained and added to HCT116 and HT29 cell culture medium with 10 μg/mL polybrene. After another 48 h, the virus group cells and control group cells were treated with 1.5 μg/mL purinomycin (the tool plasmid contained purinomycin-resistance gene to ensure that these cells successfully infected by the virus would not be killed). When the control cells were all killed by purinomycin, the surviving cells in virus group were thought to be successfully infected by the virus. Western Blot experiment was carried out to detect the expression level of genes.
Western blot experiment
Cell protein was obtained by using cracking liquid to crack the cells. The obtained protein denatured by boiling, and SDS-PAGE gel was used for electrophoresis to separate proteins with different molecular weights. After electrophoresis, SDS-PAGE gel and PVDF (polyvinylidene fluoride) paper were used to constructed gel-matrix sandwich to transfer. After that, the PVDF paper was incubated successively with the first and second antibodies, and exposed in developer to obtain the image of the target band. These experiments were repeated three times independently. The blank controls were wild type HCT116 and HT29 cells without any treatment. The negative control referred to two types of cells transferred into empty plasmids.
R Packages and statistic analysis
We conducted this research mainly based on R and R studio software. R packages employed include: “clusterProfiler”, “GEOquery”, “org.Hs.eg.db”, “HsAgilentDesign026652.db”, “R.utils”, “rjson”, “jsonlite”, “reshape2”, “ggfortify”, “limma”, “edgeR”, “GOplot”, “stringr”, “ggplot2”, “dplyr”, “pheatmap”, “glmnet”, “survival”, “survminer”, “rms”, “cowplot”, “WGCNA”, “ROCR”, “survivalROC”. The difference in progress-free survival in different groups of patients was analyzed using log-rank test in GraphPad Prism 8, and two tailed student’s t-test was used to compare mean values. * for P ≤ 0.05; ** for P ≤ 0.01; *** for P ≤ 0.001. Statistical significance was set at P ≤ 0.05 in most conditions, and specific principles are n.s for not significant.
Discussion
Resistance to therapy, which contributes a lot to poor prognosis of cancer patients, is an unavoidable part of cancer treatment, and this situation is most common in patients with recurrence and metastasis. Targeted therapy and chemotherapy are the hardest hit areas for therapy resistance and tumor relapse [
25‐
27]. It is ordinary for tumor with a strong initial response to eventually develop into a drug resistant one [
25]. Under the stimulation of treatment, drug-resistance tumor cells that successfully escape cytotoxicity become the dominant group in tumor repopulation [
3,
6,
25]. Traditional theories believed that this group of cells acted as cancer stem cells (CSCs) to mediate tumor relapse through genetic processes [
28,
29], while recent research results used persistent state or slow cycling state, a diapause-like slow proliferation state that mediates relapse through non-genetic mechanisms, to describe this group of cells [
7,
8,
10,
30]. Similarly, some primary tumors that were originally sensitive to radiotherapy show resistance behavior after recurrence in clinical practice, indicating that radiation resistant tumor cells are the culprit of tumor repopulation after radiotherapy. Therefore, our research focused on the role of radiation-resistance cells in tumor repopulation after radiotherapy (Additional file
11).
To explore this question, our attention focused on radiation resistant colorectal cancer cells, which were the consequent of repeated induction by radiation. In our previous research, we screened the targeted therapy dataset of lung cancer to predict the efficacy of radiotherapy [
31]. In this study, we used a radiation related dataset and conducted more complete experimental verification. Through the integration and differential analysis of sequencing data from radiation resistant colorectal cancer cells as well as TCGA tumor and normal tissue data of colorectal cancer patients, the co-upregulated genes were spotted. While representing increased resistance to radiotherapy, this group of genes also represented a stronger tumor proliferation capability. Based on the TCGA training set of 35 colorectal cancer patients undergoing radiotherapy, we developed a gene panel containing four genes by condensing these co-upregulated genes. And predictive ability of the gene panel was validated through five different datasets and in vitro experiments, to make the verification process more convincing.
In this work, GO analysis of the obtained 710 upregulated genes in resistant cells clarified that cell cycle, DNA replication and repair pathways were significantly enriched, which indicated that resistant cells were timely and effective in dealing with DNA damage than sensitive cells (Fig.
1b). But whether faster recovery from DNA damage response meant stronger proliferative capacity was largely unknown. Therefore, we included the TCGA-COREAD database (tumor vs. normal tissues) that could reflect tumor proliferation ability into the analysis, and found a total of 252 co-upregulated genes intersecting with GEO dataset, implying the tendency of radiation-resistance cells to participate in tumor repopulation (Fig.
1d). Enrichment analysis of these co-upregulated genes showed that pathways related to cytokinesis were activated (Additional file
1: Figure S1). This is consistent with previous studies on radiation resistant cells, which hold the view that cancer stem cells were more resistant to radiation, especially manifesting in DNA damage repair and ROS scavenging capabilities, and their stemness characteristics were closely related to accelerated regeneration during or after treatment [
24,
32,
33].
In order to further target the radiation-resistance genes closely related to tumor repopulation after radiotherapy, we used multivariate regression analysis to condense 252 co-upregulated genes and established a prognostic prediction system, which consisted of four key genes (LGR5, KCNN4, TNS4, CENPH), based on the TCGA colorectal cancer patients (Figs.
2 and
3). Meanwhile, four high-risk genes related to bad PFS were verified their prognostic ability in BRCA, LUAD, CESC, ESCA and HNSC patients (Fig.
5 and Additional file
4: Figure S4).
LGR5 (Leucine-rich repeat containing G protein-coupled receptor), which is regarded as a marker of adult stem cells, is a gene encoding for a composition of the Wnt receptor complex [
34,
35]. After activation, LGR5 recruits the LRP receptor complex which can bind to Wnt ligand [
36]. And β-catenin is then further accumulated and transported to the nucleus binding with TCF/LEF family of transcription factors, which induce the expression of Wnt target genes, including C-myc and cyclinD1, stimulating tumor cell proliferation, EMT and other processes [
36]. These are confirmed in many studies. A recent study revealed that LGR5 + colon CSCs were responsible for driving tumor re-growth after ablation [
37]. And LGR5 knockdown reduced tumor invasion and migration and blocked EMT by inhibiting the Wnt/β-catenin pathway, in both breast cancer and glioma [
38,
39]. In another study about 5-Fu induced drug-resistant colorectal cancer cells, authors found that LGR5 + tumor cells were significantly enriched in pool of resistant cells by constructing an organoid model, which was similar to our analysis that LGR5 was highly expressed in radio-resistant cells [
40].
KCNN4, a potassium channel protein activated by Ca2 + , is implicated in the promotion of cell invasion and cell proliferation, and has been considered as a poor prognostic factor for thyroid cancer [
41], pancreatic cancer [
42,
43], lung cancer [
44] and glioblastoma [
45]. AP-1, as a transcription factor induced by various stress, promotes the overexpression of KCNN4, which may depend on the Ca2 + /MET/AKT axis to exert its function [
46]. It was reported that KCNN4 regulated calcium ion signals to influence the cell cycle arrest and promote the repair of damaged DNA in glioma, thereby increasing the radio-resistance of tumors [
45,
47]. What’s more, evidence also shown that KCNN4 was upregulated by PRL-3 to promote the proliferation of colorectal cancer cells, and contributed to the invasion and metastasis of colorectal cancer by participating in the PRL-3 mediated EMT process [
48,
49]. Various evidence indicates that KCNN4 is an important factor in radiation resistance and tumor proliferation.
TNS4 is a focal adhesion molecule that belongs to the tensin family, and it is significantly up-regulated in a variety of gastrointestinal tumors and lung cancer [
50‐
52]. It regulated cell survival, proliferation, and migration through increased MET protein stability in colorectal cancer [
53]. Moreover, TNS4 expression was significantly increased in hepatocellular carcinoma and intrahepatic cholangiocarcinoma and was positively feedback-regulated by KRAS and SOX17 to stimulate migration and proliferation [
54,
55]. In addition, TNS4 could inhibit the degradation of EGFR, a molecule related to tumor cell proliferation and apoptosis inhibition, through post-translational modification, and prolonged its function [
56]. These are in agreement with the higher survival and proliferation capacity of radiation-resistant cells in our research.
As one of the essential components of active kinetochore, overexpression of centromere protein H (CENPH) in human colorectal cancer was shown to be a major cause of chromosomal instability (CIN) [
57]. This state is an important boost in driving tumor cells to produce anti-therapeutic mutations and promoting tumor evolution [
58,
59]. And CENPH has been considered to be associated with tumor progression and poor prognosis in NSCLC [
60], tongue cancer [
61], esophageal cancer [
62] and gastric cancer [
63]. And the effect of CENPH may be highly related to Survivin, an inhibitor of apoptosis protein family, which helps tumor cells to survive and restore proliferation under harmful stress [
61,
63]. Meanwhile, knockdown of CENPH retarded the growth of Hep3B, hepatic carcinoma cell, subcutaneous xenograft, and decreased the expression of Ki-67 and BCL-2 [
64]. High expression of this gene in radiation-resistant cells probably indicates enhanced proliferation capability. A brief description of four key genes was shown in Fig.
4d.
There are still some limitations that need to be further improved in this study. Firstly, the gene panel needs to be validated in patient cohorts. The validation part of this study relies on TCGA clinical data and in vitro experiments, which cannot fully represent the complexity and heterogeneity of human tumors. Secondly, the possible mechanisms involved in key genes are needed to be clarified. Research on mechanisms may throw light on these key genes to become therapeutic targets. To make up for these shortcomings in the future study, we hope to further include large sample size and well characterized clinical patient cohorts to test the ability of the gene panel in predicting the prognosis of colorectal cancer patients receiving radiotherapy. Through real-world clinical research, we will gain a more specific understanding of how the obtained gene panel will guide treatment decisions. And through sequencing technology, tumor biopsy from patients can also help us understand mechanisms of radiation resistance and tumor repopulation.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.