. 2021 Mar 8;11(3):188.

doi: 10.3390/jpm11030188.

Telomere Length Dynamics and Chromosomal Instability for Predicting Individual Radiosensitivity and Risk via Machine Learning

Jared J Luxton^{1

2}, Miles J McKenna^{1

2}, Aidan M Lewis¹, Lynn E Taylor¹, Sameer G Jhavar³, Gregory P Swanson³, Susan M Bailey^{1

2}

Affiliations

¹ Department of Environmental and Radiological Health Sciences, Colorado State University, Fort Collins, CO 80523, USA.
² Cell and Molecular Biology Program, Colorado State University, Fort Collins, CO 80523, USA.
³ Baylor Scott & White Medical Center, Temple, TX 76508, USA.

PMID: 33800260
PMCID: PMC8002073
DOI: 10.3390/jpm11030188

Telomere Length Dynamics and Chromosomal Instability for Predicting Individual Radiosensitivity and Risk via Machine Learning

Jared J Luxton et al. J Pers Med. 2021.

. 2021 Mar 8;11(3):188.

doi: 10.3390/jpm11030188.

Authors

Jared J Luxton^{1

2}, Miles J McKenna^{1

2}, Aidan M Lewis¹, Lynn E Taylor¹, Sameer G Jhavar³, Gregory P Swanson³, Susan M Bailey^{1

2}

Affiliations

¹ Department of Environmental and Radiological Health Sciences, Colorado State University, Fort Collins, CO 80523, USA.
² Cell and Molecular Biology Program, Colorado State University, Fort Collins, CO 80523, USA.
³ Baylor Scott & White Medical Center, Temple, TX 76508, USA.

PMID: 33800260
PMCID: PMC8002073
DOI: 10.3390/jpm11030188

Abstract

The ability to predict a cancer patient's response to radiotherapy and risk of developing adverse late health effects would greatly improve personalized treatment regimens and individual outcomes. Telomeres represent a compelling biomarker of individual radiosensitivity and risk, as exposure can result in dysfunctional telomere pathologies that coincidentally overlap with many radiation-induced late effects, ranging from degenerative conditions like fibrosis and cardiovascular disease to proliferative pathologies like cancer. Here, telomere length was longitudinally assessed in a cohort of fifteen prostate cancer patients undergoing Intensity Modulated Radiation Therapy (IMRT) utilizing Telomere Fluorescence in situ Hybridization (Telo-FISH). To evaluate genome instability and enhance predictions for individual patient risk of secondary malignancy, chromosome aberrations were assessed utilizing directional Genomic Hybridization (dGH) for high-resolution inversion detection. We present the first implementation of individual telomere length data in a machine learning model, XGBoost, trained on pre-radiotherapy (baseline) and in vitro exposed (4 Gy γ-rays) telomere length measurements, to predict post radiotherapy telomeric outcomes, which together with chromosomal instability provide insight into individual radiosensitivity and risk for radiation-induced late effects.

Keywords: IMRT; chromosomal instability; individual radiosensitivity; inversions; late effects; machine learning; personalized medicine; prostate cancer; telomeres.

PubMed Disclaimer

Conflict of interest statement

S.M.B. is a cofounder and scientific advisory board member of KromaTiD, Inc.

Figures

**Figure 1**
Telomere length dynamics (Telo-FISH). Mean telomere length expressed as relative fluorescence intensity. (A) Time-course of blood sample collection for all prostate cancer patients (n = 15; 50 cells/patient/time point scored): 1 non irrad = pre-IMRT non-irradiated (0 Gy); 2 irrad @ 4 Gy: pre-IMRT in vitro irradiated; 3B: immediate post-IMRT; and 4C: 3-months post-IMRT. Boxes denote quantiles, horizontal grey lines denote medians. Telomere length values were standardized using BJ1/BJ-hTERT controls. (B) Hierarchical clustering of patients by longitudinal changes in mean telomere length (z-score normalized). (C) Time-course for clustered groups of patients (n = 3, purple; n = 11, blue); center lines denote medians, lighter bands denote confidence intervals. Patient ID 13 not clustered (sample failed to culture). Significance was assessed using a repeated measures ANOVA and post hoc Tukey’s HSD test.

**Figure 2**
Telomere length distributions (Telo-FISH). Individual telomere length distributions of prostate cancer patients (n = 15): 1 non irrad = pre-IMRT non-irradiated (0 Gy); 2 irrad @ 4 Gy = pre-IMRT in vitro irradiated; 3B = immediate post-IMRT; and 4C = 3-months post-IMRT. RFI: Relative Fluorescence Intensity. Individual telomeres from the pre-therapy non-irradiated time point were split into quartiles, designating telomeres in the bottom 25% (yellow), middle 50% (blue), and top 25% (red). Quartile cut-off values, established by the distribution of the pre-therapy non-irradiated time point, were applied to subsequent time points to feature engineer the relative shortest, mid-length, and longest individual telomeres per time point. (A) Individual telomere length distributions for all patients (averaged) per time point. (B) Individual telomere length distributions for patients in mean telomere length clustered group 1 (n = 3) and (C) group 2 (n = 11).

**Figure 3**
Longitudinal shifts in numbers of short and long telomeres (Telo-FISH). Numbers of short and long telomeres from individual telomere length distributions: 1 non irrad = pre-IMRT non-irradiated (0 Gy); 2 irrad @ 4 Gy = pre-IMRT in vitro irradiated; 3B = immediate post-IMRT; and 4C = 3-months post-IMRT. Shortest (yellow), mid-length (blue), and longest (red) telomeres were feature engineered per patient (n = 15). (A) Counts of short, medium, and long telomeres; 4600 individual telomeres per patient per time point. Significance was assessed using a square-root transformation and a repeated measures ANOVA with post hoc Tukey’s HSD test. Hierarchical clustering of patients by longitudinal changes in numbers of short (B) and long telomeres (D) (z-score normalized). Time-courses of patient groups (n = 3, purple; n = 11, blue) clustered by numbers of short (C) and long (E) telomeres; center lines denote medians and lighter bands denote confidence intervals. Patient ID 13 not clustered (sample failed to culture).

**Figure 4**
Linear regression models failed to predict post-IMRT telomeric outcomes. Ordinary least squares linear regression models were employed using pre-IMRT telomeric data (Telo-FISH) from the pre-IMRT non-irradiated (0 Gy) or the pre-IMRT in vitro irradiated (4 Gy) samples to predict 3-month post-IMRT telomeric outcomes. Models were made using (A) mean telomere length (R² = 0.161, 0.165), (B) numbers of short (R² = 0.433, 0.554), and (C) numbers of long (R² = 0.046, 0.208) telomeres.

**Figure 5**
Processing of Telo-FISH data for training and testing XGBoost models. Schematic for machine learning pipeline used for individual telomere length data (Telo-FISH). Preprocessed data: Feature 1: pre-IMRT individual telomere length measurements (n = 128,800); Feature 2: pre-IMRT sample labels (non-irradiated, in vitro irradiated, encoded as 0/1); Target: 3 months post-IMRT telomeric outcomes (mean telomere length or numbers of short and long telomeres). Data is randomly shuffled and stratified (by patient ID and pre-therapy sample origin) and split into training (80%) and test (20%) datasets; patient IDs are stripped after splitting. Five-fold cross validation was used, and models were evaluated with Mean Absolute Error (MAE) and R² scores between predicted and true values in the test set.

**Figure 6**
High performance of XGBoost models for predicting post-IMRT telomeric outcomes. Three separate XGBoost models were trained on pre-IMRT individual telomere length measurements (n = 103,040, Telo-FISH) to predict 3-month post-IMRT telomeric outcomes. Trained XGBoost models were challenged with the test set (new data, n = 25,760 individual telomeres) to predict 3-month post-IMRT telomeric outcomes for (A) mean telomere length, (B) numbers of short, and (C) numbers of long telomeres. XGBoost predictions were averaged on a per patient basis for (D) mean telomere length, (E) numbers of short, and (F) numbers of long telomeres; blue line represents a simple regression line (X/Y), lighter bands the 95% confidence interval, R² values (coefficient of determination) are noted in bold.

**Figure 7**
Strong generalizability of XGBoost models to new patient data (leave one out approach). (A–N) Fourteen separate XGBoost models were iteratively trained on pre-IMRT individual telomere length measurements (n = 93,840, Telo-FISH) excluding one patient, and tested to predict 3-month post-IMRT mean telomere length, with inclusion of the patient excluded during training. Each panel is one model; patients excluded during training for that model are noted in the panel headers and plotted in black. Lines represent a simple regression line (X/Y), lighter bands the 95% confidence interval, R² values (coefficient of determination) are noted in bold.

**Figure 8**
Longitudinal analyses of chromosomal instability. Whole blood was collected from prostate cancer patients undergoing IMRT (n = 15) and chromosome aberrations assessed using directional Genomic Hybridization (dGH) on metaphase spreads (n = 30/patient/timepoint scored): 1 non irrad = pre-IMRT non-irradiated (0 Gy); 2 irrad @ 4 Gy = pre-IMRT in vitro irradiated; 3B = immediate post-IMRT; and 4C = 3-month post-IMRT. Frequencies of (A) inversions, (B) translocations, (C) dicentrics, (D) excess chromosome fragments (deletions), and (E) sister chromatid exchanges (SCE). Significance was assessed for average aberration frequencies using a repeated measures ANOVA and post hoc Tukey’s HSD test. p < 0.05 *, p < 0.01 **, p < 0.001 ***.

**Figure 9**
Clustering of patients by chromosome aberration frequencies. Time-courses for groups of patients hierarchically clustered into discrete groups (blue, purple) per aberration type: 1 non irrad = pre-IMRT non-irradiated (0 Gy); 2 irrad @ 4 Gy = pre-IMRT in vitro irradiated; 3B = immediate post-IMRT; and 4C = 3-month post-IMRT. Clustered groups of patients for frequencies of (A) inversions, (B) translocations, (C) dicentrics, (D) excess chromosome fragments (deletions), and (E) aberration index, which was created by summing all aberration types. Center lines denote medians and lighter bands denote confidence intervals.

**Figure 10**
Neither linear regression nor XGBoost models successfully predicted post-IMRT chromosome aberration (CA) frequencies. Ordinary least squares linear regression models were made using pre-IMRT average CA frequencies from the non-irradiated (0 Gy) or in vitro irradiated (4 Gy) samples to predict 3-month post-IMRT average CA frequencies. Models were made for (A) inversions, (B) translocations, (C) dicentrics, (D) excess chromosome fragments (deletions), and (E) aberration index, which was created by summing all CA per cell. The model for dicentrics performed best, with an R² = 0.514. XGBoost models were trained on pre-IMRT counts of different CA types per cell (n = 672) to predict 3-month post-IMRT average CA frequencies. Trained XGBoost models were challenged with the test set (new data, n = 168 cells) to predict 3-month post-IMRT average CA frequencies. XGBoost predictions were averaged on a per patient basis for (F) inversions, (G) translocations, (H) dicentrics, (I) excess chromosome fragments (deletions), and (J) aberration index. For all models, R² values between averaged predictions and actual values did not exceed 0.100.

See this image and copyright information in PMC

Cited by

A correlation graph attention network for classifying chromosomal instabilities from histopathology whole-slide images.
Liu L, Wang Y, Chang J, Zhang P, Xiong S, Liu H. Liu L, et al. iScience. 2023 May 18;26(6):106874. doi: 10.1016/j.isci.2023.106874. eCollection 2023 Jun 16. iScience. 2023. PMID: 37260749 Free PMC article.
Machine Learning & Molecular Radiation Tumor Biomarkers.
Rydzewski NR, Helzer KT, Bootsma M, Shi Y, Bakhtiar H, Sjöström M, Zhao SG. Rydzewski NR, et al. Semin Radiat Oncol. 2023 Jul;33(3):243-251. doi: 10.1016/j.semradonc.2023.03.002. Semin Radiat Oncol. 2023. PMID: 37331779 Free PMC article. Review.
Telomeric RNA (TERRA) increases in response to spaceflight and high-altitude climbing.
Al-Turki TM, Maranon DG, Nelson CB, Lewis AM, Luxton JJ, Taylor LE, Altina N, Wu F, Du H, Kim J, Damle N, Overbey E, Meydan C, Grigorev K, Winer DA, Furman D, Mason CE, Bailey SM. Al-Turki TM, et al. Commun Biol. 2024 Jun 11;7(1):698. doi: 10.1038/s42003-024-06014-x. Commun Biol. 2024. PMID: 38862827 Free PMC article.
Role of telomere length in human carcinogenesis (Review).
Tsatsakis A, Oikonomopoulou T, Nikolouzakis TK, Vakonaki E, Tzatzarakis M, Flamourakis M, Renieri E, Fragkiadaki P, Iliaki E, Bachlitzanaki M, Karzi V, Katsikantami I, Kakridonis F, Hatzidaki E, Tolia M, Svistunov AA, Spandidos DA, Nikitovic D, Tsiaoussis J, Berdiaki A. Tsatsakis A, et al. Int J Oncol. 2023 Jul;63(1):78. doi: 10.3892/ijo.2023.5526. Epub 2023 May 26. Int J Oncol. 2023. PMID: 37232367 Free PMC article. Review.
Radiation Biomarkers: Silver Bullet, or Wild Goose Chase?
Rutten EA, Badie C. Rutten EA, et al. J Pers Med. 2021 Jun 25;11(7):603. doi: 10.3390/jpm11070603. J Pers Med. 2021. PMID: 34202274 Free PMC article.

See all "Cited by" articles

References

1. Barnett G.C., West C.M.L., Dunning A.M., Elliott R.M., Coles C.E., Pharoah P.D.P., Burnet N.G. Normal Tissue Reactions to Radiotherapy. Nat. Rev. Cancer. 2009;9:134–142. doi: 10.1038/nrc2587. - DOI - PMC - PubMed
1. Bentzen S.M. Preventing or Reducing Late Side Effects of Radiation Therapy: Radiobiology Meets Molecular Pathology. Nat. Rev. Cancer. 2006;6:702–713. doi: 10.1038/nrc1950. - DOI - PubMed
1. Yusuf S.W., Venkatesulu B.P., Mahadevan L.S., Krishnan S. Radiation-Induced Cardiovascular Disease: A Clinical Perspective. Front. Cardiovasc. Med. 2017;4 doi: 10.3389/fcvm.2017.00066. - DOI - PMC - PubMed
1. Carver J.R., Shapiro C.L., Ng A., Jacobs L., Schwartz C., Virgo K.S., Hagerty K.L., Somerfield M.R., Vaughn D.J., ASCO Cancer Survivorship Expert Panel American Society of Clinical Oncology Clinical Evidence Review on the Ongoing Care of Adult Cancer Survivors: Cardiac and Pulmonary Late Effects. J. Clin. Oncol. 2007;25:3991–4008. doi: 10.1200/JCO.2007.10.9777. - DOI - PubMed
1. Greene-Schloesser D., Robbins M.E. Radiation-Induced Cognitive Impairment-from Bench to Bedside. Neuro Oncol. 2012;14:iv37–iv44. doi: 10.1093/neuonc/nos196. - DOI - PMC - PubMed

Grants and funding

Advanced Industry (AI) Bioscience Proof of Concept (POC) award/Colorado Office of Economic Development and International Trade (OEDIT)

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Telomere Length Dynamics and Chromosomal Instability for Predicting Individual Radiosensitivity and Risk via Machine Learning

Affiliations

Telomere Length Dynamics and Chromosomal Instability for Predicting Individual Radiosensitivity and Risk via Machine Learning

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources