Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019;8(1):1043-1053.
doi: 10.1080/22221751.2019.1637702.

Comprehensive subspecies identification of 175 nontuberculous mycobacteria species based on 7547 genomic profiles

Affiliations

Comprehensive subspecies identification of 175 nontuberculous mycobacteria species based on 7547 genomic profiles

Yuki Matsumoto et al. Emerg Microbes Infect. 2019.

Abstract

The prevalence of nontuberculous mycobacteria (NTM) pulmonary diseases has been increasing worldwide. NTM consist of approximately 200 species and distinguishing between them at the subspecies level is critical to treatment. In this study, we sequenced 63 NTM genomes, 27 of which were newly determined, by hybrid assembly using sequencers from Illumina and Oxford Nanopore Technologies (ONT). This analysis expanded the available genomic data to 175 NTM species and redefined their subgenus classification. We also developed a novel multi-locus sequence typing (MLST) database based on 184 genes from 7547 assemblies and an identification software, mlstverse, which can also be used for detecting other bacteria given a suitable MLST database. This method showed the highest sensitivity and specificity amongst conventional methods and demonstrated the capacity for rapid detection of NTM, 10 min of sequencing of the ONT MinION being sufficient. Application of this methodology could improve disease epidemiology and increase the cure rates of NTM diseases.

Keywords: Nontuberculous mycobacteria; comparative genomics; multi-locus sequence typing; next-generation sequencing; pulmonary diseases.

PubMed Disclaimer

Conflict of interest statement

No potential conflict of interest was reported by the authors.

Figures

Figure 1.
Figure 1.
Taxonomic analysis using all Mycobacterium species. A) Assembly quality. The bar plot indicates the N50 length in each species. The grey bar was available with the NCBI assemblies. The x-axis is sorted in ascending order with y-axis value. The blue dots show assemblies obtained in this work. The black and red dashed lines show the median N50s calculated using assemblies available from NCBI and our assemblies (also see Table S1). Growth type was defined according to the time required to grow bacterial colonies [i.e. rapid (3–7 days) and slow (>7 days)]. B) Core and pan genome analysis. Gene clusters were discriminated based on differences in the percent identity and length. The numbers of core (bottom) and pan (top) genes are shown, based on the number of NTM species. The error bars were calculated from 1000 replicates of randomly selected species. C) Comparative analysis. A phylogenetic tree was constructed using the 80% core genome (also see Figure S1). Filled colours correspond to the subgenus, growth type, pathogenicity, and assembly availability.
Figure 2.
Figure 2.
Pipeline workflow. Mlstverse uses an fasta or fastq file as input and outputs an identification result as a table, using a given database.
Figure 3.
Figure 3.
Distribution of mapped reads in 29 clinical samples. The coverage of each gene in each sample is shown as a heat map for 53 ribosomal proteins genes and 131 species-specific genes. The coverage distribution is also shown on the right, with a binomial distribution fitting.
Figure 4.
Figure 4.
Sensitivity and specificity. A) Species-level evaluation of sequence homologies of 16S rRNA genes obtained from SILVA (left) and profile similarity in the mlstverse database (right). Species were ordered based on hierarchical-clustering results with complete-linkage. The profile shown on the right was sorted in the same order. B) Subspecies-level evaluation of M. abscessus profile similarities in the mlstverse database. The colour bar under the dendrogram indicates subspecies information stored in assembly metadata corresponding to subspecies that are not yet classified (grey), subsp. abscessus (red), subsp. bolletii (blue), or subsp. massiliense (yellow) (also see Figure S3).
Figure 5.
Figure 5.
Rapid detection of M. abscessus at the subspecies level. A and B) MLST scores vs. coverage with Illumina short reads (A) and ONT long reads (B). Views based on a wide range of coverage (top of A and B) or focused on low coverage range (bottom of A and B) are shown. C) Amount of output data vs. time using an ONT MinION sequencer. Data obtained during the first 48 h (2,880 min; top) or initial 30 min (bottom) are shown.

References

    1. Falkinham JO. Environmental sources of nontuberculous mycobacteria. Clin Chest Med. 2015;36:35–41. doi: 10.1016/j.ccm.2014.10.003 - DOI - PubMed
    1. Halstrom S, Price P, Thomson R.. Review: environmental mycobacteria as a cause of human infection. Int J Mycobacteriology. 2015;4:81–91. doi: 10.1016/j.ijmyco.2015.03.002 - DOI - PubMed
    1. Namkoong H, Kurashima A, Morimoto K, et al. . Epidemiology of pulmonary nontuberculous Mycobacterial disease, Japan1. Emerg Infect Dis. 2016;22:1116–1117. doi: 10.3201/eid2206.151086 - DOI - PMC - PubMed
    1. Smith GS, Ghio AJ, Stout JE, et al. . Epidemiology of nontuberculous mycobacteria isolations among central North Carolina residents, 2006–2010. J Infect. 2016;72:678–686. doi: 10.1016/j.jinf.2016.03.008 - DOI - PubMed
    1. Agizew T, Basotli J, Alexander H, et al. . Higher-than-expected prevalence of non-tuberculous mycobacteria in HIV setting in Botswana: Implications for diagnostic algorithms using Xpert MTB/RIF assay. PLoS One. 2017;12:e0189981. doi: 10.1371/journal.pone.0189981 - DOI - PMC - PubMed

MeSH terms