Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Jan 8:10:594030.
doi: 10.3389/fcimb.2020.594030. eCollection 2020.

Machine Learning Algorithms Evaluate Immune Response to Novel Mycobacterium tuberculosis Antigens for Diagnosis of Tuberculosis

Affiliations

Machine Learning Algorithms Evaluate Immune Response to Novel Mycobacterium tuberculosis Antigens for Diagnosis of Tuberculosis

Noëmi Rebecca Meier et al. Front Cell Infect Microbiol. .

Abstract

Rationale: Tuberculosis diagnosis in children remains challenging. Microbiological confirmation of tuberculosis disease is often lacking, and standard immunodiagnostic including the tuberculin skin test and interferon-γ release assay for tuberculosis infection has limited sensitivity. Recent research suggests that inclusion of novel Mycobacterium tuberculosis antigens has the potential to improve standard immunodiagnostic tests for tuberculosis.

Objective: To identify optimal antigen-cytokine combinations using novel Mycobacterium tuberculosis antigens and cytokine read-outs by machine learning algorithms to improve immunodiagnostic assays for tuberculosis.

Methods: A total of 80 children undergoing investigation of tuberculosis were included (15 confirmed tuberculosis disease, five unconfirmed tuberculosis disease, 28 tuberculosis infection and 32 unlikely tuberculosis). Whole blood was stimulated with 10 novel Mycobacterium tuberculosis antigens and a fusion protein of early secretory antigenic target (ESAT)-6 and culture filtrate protein (CFP) 10. Cytokines were measured using xMAP multiplex assays. Machine learning algorithms defined a discriminative classifier with performance measured using area under the receiver operating characteristics.

Measurements and main results: We found the following four antigen-cytokine pairs had a higher weight in the discriminative classifier compared to the standard ESAT-6/CFP-10-induced interferon-γ: Rv2346/47c- and Rv3614/15c-induced interferon-gamma inducible protein-10; Rv2031c-induced granulocyte-macrophage colony-stimulating factor and ESAT-6/CFP-10-induced tumor necrosis factor-α. A combination of the 10 best antigen-cytokine pairs resulted in area under the curve of 0.92 ± 0.04.

Conclusion: We exploited the use of machine learning algorithms as a key tool to evaluate large immunological datasets. This identified several antigen-cytokine pairs with the potential to improve immunodiagnostic tests for tuberculosis in children.

Keywords: cytokines; immune response; interferon-gamma release assay; novel antigens; pediatric tuberculosis.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

Figure 1
Figure 1
Whole blood was stimulated with novel antigens and data was analyzed with different machine learning algorithms. (A) Whole blood was stimulated with 11 mycobacterial antigens, left unstimulated and with a positive control, overnight and supernatant was analyzed using Luminex technology to measure 11 different cytokines (B) data was normalized within antigen–cytokine pairs using min-max or mean–std normalization or within patient distribution using the latter only. Data (n = 59) was divided into five equal parts and a classifier discriminating healthy vs. sick children was trained using four subsets and tested on one subset (cross-validation). The algorithm’s parameters were adjusted until performance was optimal. ROC curves were used to measure performance. (C) K-means clustering approach was used to allocate individual data points to three cluster centers randomly. This approach was repeated until optimal data point allocation was reached meaning the sum of the distances from data point to cluster centers is minimized.
Figure 2
Figure 2
Normalization of data contributes to performance of discriminative classifier (A) Cytokine concentrations for individual patients. Results are sorted by patient group and clusters (2, 1 or 0), and antigen–cytokine pairs. Clustering was performed using K-means algorithm. Min–max normalization was applied to cytokine–antigen concentrations, mean–std normalization was applied to between-individual measurements (color change from dark blue to light green represents an increase in relative cytokine concentration). (B) AUROC curve showing the performance of the binary classifier (confirmed/unconfirmed TB and TB infection versus TB exposed) in 59 patients using different normalization methods: min–max and mean–std; normalization of antigen–cytokine pairs; min–max/mean-std combining an antigen–cytokine pair normalization with individual patient normalization.
Figure 3
Figure 3
Effect of normalization of antigen–cytokine pairs and normalization for individual patients (A) Performance of binary classifier using the 10 most important features and applying an antigen–cytokine pair normalization (min–max) and a normalization for individual patients (mean–std) (B) Combination of 10 antigen–cytokine pairs contributing the most to performance of trained discriminative classifier with min–max normalization of antigen–cytokine pairs and mean–std individual patient normalization.

References

    1. Bahk Y. Y., Kim S. A., Kim J. S., Euh H. J., Bai G. H., Cho S. N., et al. (2004). Antigens secreted from Mycobacterium tuberculosis: identification by proteomics approach and test for diagnostic marker. Proteomics 4 (11), 3299–3307. 10.1002/pmic.200400980 - DOI - PubMed
    1. Belay M., Legesse M., Mihret A., Bekele Y., Ottenhoff T. H., Franken K. L., et al. (2015). Pro- and anti-inflammatory cytokines against Rv2031 are elevated during latent tuberculosis: a study in cohorts of tuberculosis patients, household contacts and community controls in an endemic setting. PloS One 10 (4), e0124134. 10.1371/journal.pone.0124134 - DOI - PMC - PubMed
    1. Bryson B. D., Rosebrock T. R., Tafesse F. G., Itoh C. Y., Nibasumba A., Babunovic G. H., et al. (2019). Heterogeneous GM-CSF signaling in macrophages is associated with control of Mycobacterium tuberculosis. Nat. Commun. 10 (1), 2329. 10.1038/s41467-019-10065-8 - DOI - PMC - PubMed
    1. Chegou N. N., Black G. F., Kidd M., van Helden P. D., Walzl G. (2009). Host markers in QuantiFERON supernatants differentiate active TB from latent TB infection: preliminary report. BMC Pulm. Med. 9, 21. 10.1186/1471-2466-9-21 - DOI - PMC - PubMed
    1. Chegou N. N., Essone P. N., Loxton A. G., Stanley K., Black G. F., van der Spuy G. D., et al. (2012). Potential of host markers produced by infection phase-dependent antigen-stimulated cells for the diagnosis of tuberculosis in a highly endemic area. PloS One 7 (6), e38501. 10.1371/annotation/bc36a9c6-d5c0-4d55-bc92-9ce4a07b4f70 - DOI - PMC - PubMed

Publication types

Substances