. 2019 Sep 15;35(18):3240-3249.

doi: 10.1093/bioinformatics/btz067.

DeepAMR for predicting co-occurrent resistance of Mycobacterium tuberculosis

Yang Yang^{1

2}, Timothy M Walker³, A Sarah Walker^{3

4}, Daniel J Wilson⁵, Timothy E A Peto^{3

4}, Derrick W Crook^{3

4

6}, Farah Shamout¹; CRyPTIC Consortium; Tingting Zhu¹, David A Clifton^{1

2}

Collaborators, Affiliations

Collaborators

CRyPTIC Consortium:
Irena Arandjelovic, Iñaki Comas, Maha R Farhat, Qian Gao, Vitali Sintchenko, Dick van Soolingen, Sarah Hoosdally, Ana L Gibertoni Cruz, Joshua Carter, Clara Grazian, Sarah G Earle, Samaneh Kouchaki, Yang Yang, Timothy M Walker, Philip W Fowler, David A Clifton, Zamin Iqbal, Martin Hunt, E Grace Smith, Priti Rathod, Lisa Jarrett, Daniela Matias, Daniela M Cirillo, Emanuele Borroni, Simone Battaglia, Arash Ghodousi, Andrea Spitaleri, Andrea Cabibbe, Sabira Tahseen, Kayzad Nilgiriwala, Sanchi Shah, Camilla Rodrigues, Priti Kambli, Utkarsha Surve, Rukhsar Khot, Stefan Niemann, Thomas Kohl, Matthias Merker, Harald Hoffmann, Nikolay Molodtsov, Sara Plesnik, Nazir Ismail, Guy Thwaites, Thuong Nguyen Thuy Thuong, Nhung Hoang Ngoc, Vijay Srinivasan, David Moore, David Jorge Coronel, Walter Solano, George F Gao, Guangxue He, Yanlin Zhao, Aijing Ma, Chunfa Liu, Baoli Zhu, Ian Laurenson, Pauline Claxton, Anastasia Koch, Robert Wilkinson, Ajit Lalvani, James Posey, James Jennifer Gardy, Jim Werngren, Nicholas Paton, Ruwen Jou, Mei-Hua Wu, Wan-Hsuan Lin, Lucilaine Ferrazoli, Rosangela Siqueira de Oliveira, São Paulo

Affiliations

¹ Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK.
² Oxford-Suzhou Centre for Advanced Research, Suzhou, China.
³ Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital Headley Way, Oxford, UK.
⁴ NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Headley Way Headington, Oxford, UK.
⁵ Big Data Institute, Nuffield Department of Population Health, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Old Road Campus, Oxford, UK.
⁶ National Infection Service, Public Health England, Wellington House 133-155 Waterloo Road, London, UK.

PMID: 30689732
PMCID: PMC6748723
DOI: 10.1093/bioinformatics/btz067

DeepAMR for predicting co-occurrent resistance of Mycobacterium tuberculosis

Yang Yang et al. Bioinformatics. 2019.

. 2019 Sep 15;35(18):3240-3249.

doi: 10.1093/bioinformatics/btz067.

Authors

Collaborators

CRyPTIC Consortium:
Irena Arandjelovic, Iñaki Comas, Maha R Farhat, Qian Gao, Vitali Sintchenko, Dick van Soolingen, Sarah Hoosdally, Ana L Gibertoni Cruz, Joshua Carter, Clara Grazian, Sarah G Earle, Samaneh Kouchaki, Yang Yang, Timothy M Walker, Philip W Fowler, David A Clifton, Zamin Iqbal, Martin Hunt, E Grace Smith, Priti Rathod, Lisa Jarrett, Daniela Matias, Daniela M Cirillo, Emanuele Borroni, Simone Battaglia, Arash Ghodousi, Andrea Spitaleri, Andrea Cabibbe, Sabira Tahseen, Kayzad Nilgiriwala, Sanchi Shah, Camilla Rodrigues, Priti Kambli, Utkarsha Surve, Rukhsar Khot, Stefan Niemann, Thomas Kohl, Matthias Merker, Harald Hoffmann, Nikolay Molodtsov, Sara Plesnik, Nazir Ismail, Guy Thwaites, Thuong Nguyen Thuy Thuong, Nhung Hoang Ngoc, Vijay Srinivasan, David Moore, David Jorge Coronel, Walter Solano, George F Gao, Guangxue He, Yanlin Zhao, Aijing Ma, Chunfa Liu, Baoli Zhu, Ian Laurenson, Pauline Claxton, Anastasia Koch, Robert Wilkinson, Ajit Lalvani, James Posey, James Jennifer Gardy, Jim Werngren, Nicholas Paton, Ruwen Jou, Mei-Hua Wu, Wan-Hsuan Lin, Lucilaine Ferrazoli, Rosangela Siqueira de Oliveira, São Paulo

Affiliations

¹ Institute of Biomedical Engineering, Department of Engineering Science, University of Oxford, Oxford, UK.
² Oxford-Suzhou Centre for Advanced Research, Suzhou, China.
³ Nuffield Department of Medicine, University of Oxford, John Radcliffe Hospital Headley Way, Oxford, UK.
⁴ NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Headley Way Headington, Oxford, UK.
⁵ Big Data Institute, Nuffield Department of Population Health, Li Ka Shing Centre for Health Information and Discovery, University of Oxford, Old Road Campus, Oxford, UK.
⁶ National Infection Service, Public Health England, Wellington House 133-155 Waterloo Road, London, UK.

PMID: 30689732
PMCID: PMC6748723
DOI: 10.1093/bioinformatics/btz067

Abstract

Motivation: Resistance co-occurrence within first-line anti-tuberculosis (TB) drugs is a common phenomenon. Existing methods based on genetic data analysis of Mycobacterium tuberculosis (MTB) have been able to predict resistance of MTB to individual drugs, but have not considered the resistance co-occurrence and cannot capture latent structure of genomic data that corresponds to lineages.

Results: We used a large cohort of TB patients from 16 countries across six continents where whole-genome sequences for each isolate and associated phenotype to anti-TB drugs were obtained using drug susceptibility testing recommended by the World Health Organization. We then proposed an end-to-end multi-task model with deep denoising auto-encoder (DeepAMR) for multiple drug classification and developed DeepAMR_cluster, a clustering variant based on DeepAMR, for learning clusters in latent space of the data. The results showed that DeepAMR outperformed baseline model and four machine learning models with mean AUROC from 94.4% to 98.7% for predicting resistance to four first-line drugs [i.e. isoniazid (INH), ethambutol (EMB), rifampicin (RIF), pyrazinamide (PZA)], multi-drug resistant TB (MDR-TB) and pan-susceptible TB (PANS-TB: MTB that is susceptible to all four first-line anti-TB drugs). In the case of INH, EMB, PZA and MDR-TB, DeepAMR achieved its best mean sensitivity of 94.3%, 91.5%, 87.3% and 96.3%, respectively. While in the case of RIF and PANS-TB, it generated 94.2% and 92.2% sensitivity, which were lower than baseline model by 0.7% and 1.9%, respectively. t-SNE visualization shows that DeepAMR_cluster captures lineage-related clusters in the latent space.

Availability and implementation: The details of source code are provided at http://www.robots.ox.ac.uk/∼davidc/code.php.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

**Fig. 1.**
Illustration of latent structure using t-SNE: (a) lineage distribution resulted from DeepAMR; (b) phenotype distribution resulted from DeepAMR; (c) lineage distribution resulted from DeepAMR_cluster and (d) predicted clusters resulted from DeepAMR_cluster

**Fig. 2.**
Overview of phenotype of the examined 13 403 MTB isolates. (a) Histogram showing the phenotype of the MTB isolates for each individual anti-TB drug obtained by the drug susceptibility test (up to 11 anti-TB drugs were tested for all isolates). For each drug, the isolates with missing phenotype were excluded. (b) Heatmap visualizing the proportion of pair-wise resistance co-occurrence (non-diagonal) and mono-resistance (diagonal) across anti-TB drugs. The non-diagonal elements correspond to poly-resistant isolates that were resistant to at least two anti-TB drugs. The co-occurrence matrix is symmetric so the upper right half of the graph shows all pair-wise co-occurrence cases

**Fig. 3.**
Ranked SNPs based on permutation feature importance resulting in positive metric with respect to INH, EMB, RIF and PZA, respectively

See this image and copyright information in PMC

References

1. Cheng W., et al. (2010) Bayes optimal multilabel classification via probabilistic classifier chains. In: Proceedings of the 27th International Conference on Machine Learning (ICML-10) , pp. 279–286.
1. Erhan D., et al. (2009) The difficulty of training deep architectures and the effect of unsupervised pre-training. In: Artificial Intelligence and Statistics, pp. 153–160.
1. Farhat M.R., et al. (2016) Genetic determinants of drug resistance in Mycobacterium tuberculosis and their diagnostic value. Am. J. Respir. Crit. Care Med., 194, 621–630. - PMC - PubMed
1. Gisbrecht A., et al. (2015) Parametric nonlinear dimensionality reduction using kernel t-SNE. Neurocomputing, 147, 71–82.
1. Gönen M. (2014) Coupled dimensionality reduction and classification for supervised and semi-supervised multilabel learning. Pattern Recognit. Lett., 38, 132–141. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions

Substances

Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

DeepAMR for predicting co-occurrent resistance of Mycobacterium tuberculosis

Collaborators

Affiliations

DeepAMR for predicting co-occurrent resistance of Mycobacterium tuberculosis

Authors

Collaborators

Affiliations

Abstract

Figures

References

Publication types

MeSH terms

Substances

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources