Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Jan 30;24(1):32.
doi: 10.1186/s12859-023-05155-w.

GENTLE: a novel bioinformatics tool for generating features and building classifiers from T cell repertoire cancer data

Affiliations

GENTLE: a novel bioinformatics tool for generating features and building classifiers from T cell repertoire cancer data

Dhiego Souto Andrade et al. BMC Bioinformatics. .

Abstract

Background: In the global effort to discover biomarkers for cancer prognosis, prediction tools have become essential resources. TCR (T cell receptor) repertoires contain important features that differentiate healthy controls from cancer patients or differentiate outcomes for patients being treated with different drugs. Considering, tools that can easily and quickly generate and identify important features out of TCR repertoire data and build accurate classifiers to predict future outcomes are essential.

Results: This paper introduces GENTLE (GENerator of T cell receptor repertoire features for machine LEarning): an open-source, user-friendly web-application tool that allows TCR repertoire researchers to discover important features; to create classifier models and evaluate them with metrics; and to quickly generate visualizations for data interpretations. We performed a case study with repertoires of TRegs (regulatory T cells) and TConvs (conventional T cells) from healthy controls versus patients with breast cancer. We showed that diversity features were able to distinguish between the groups. Moreover, the classifiers built with these features could correctly classify samples ('Healthy' or 'Breast Cancer')from the TRegs repertoire when trained with the TConvs repertoire, and from the TConvs repertoire when trained with the TRegs repertoire.

Conclusion: The paper walks through installing and using GENTLE and presents a case study and results to demonstrate the application's utility. GENTLE is geared towards any researcher working with TCR repertoire data and aims to discover predictive features from these data and build accurate classifiers. GENTLE is available on https://github.com/dhiego22/gentle and https://share.streamlit.io/dhiego22/gentle/main/gentle.py .

Keywords: Feature selection; Machine learning tools; T cell receptor repertoire.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
GENTLE Workflow
Fig. 2
Fig. 2
AD Scatter Plot of each dimension using the TConvs dataset. EH Scatter Plot of each fdimension using the TRegs dataset. It is important to emphasize that we used the most predictive features on each scatter plot, according to the feature selection methods
Fig. 3
Fig. 3
AC Stratified validation using threefold and 100 repeats of the classifiers trained with the TConvs dataset where A is the Gaussian Naive Bayes, B is the Linear Discriminant Analysis and C are Linear Regression and Decision Tree classifiers. D Confusion matrix of the model trained with the TConvs dataset (train) and validated with the TRegs dataset (test). E Stratified validation using threefold and 100 repeats of the Decision Tree classifier trained with the TRegs dataset. F Confusion matrix of the model trained with the TRegs dataset (train) and validated with the TConvs dataset (test)

References

    1. Kumagai S, Togashi Y, Kamada T, Sugiyama E, Nishinakamura H, Takeuchi Y, et al. The PD-1 expression balance between effector and regulatory T cells predicts the clinical efficacy of PD-1 blockade therapies. Nat Immunol. 2020;21:1346–1358. doi: 10.1038/s41590-020-0769-3. - DOI - PubMed
    1. Zhao J, Chen AX, Gartrell RD, Silverman AM, Aparicio L, Chu T, et al. Immune and genomic correlates of response to anti-PD-1 immunotherapy in glioblastoma. Nat Med. 2019;25:462–469. doi: 10.1038/s41591-019-0349-y. - DOI - PMC - PubMed
    1. Pai JA, Satpathy AT. High-throughput and single-cell T cell receptor sequencing technologies. Nat Methods. 2021;18:881–892. doi: 10.1038/s41592-021-01201-8. - DOI - PMC - PubMed
    1. Girardi M. Immunosurveillance and immunoregulation by gammadelta T cells. J Invest Dermatol. 2006;126:25–31. doi: 10.1038/sj.jid.5700003. - DOI - PubMed
    1. Arnaout RA, Prak ETL, Schwab N, Rubelt F. Adaptive immune receptor repertoire community. The future of blood testing is the immunome. Front Immunol. 2021;12:626793. doi: 10.3389/fimmu.2021.626793. - DOI - PMC - PubMed

Substances