Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jun 29;10(3):e0169221.
doi: 10.1128/spectrum.01692-21. Epub 2022 Apr 25.

Construction and Analysis of the Complete Genome Sequence of Leprosy Agent Mycobacterium lepromatosis

Affiliations

Construction and Analysis of the Complete Genome Sequence of Leprosy Agent Mycobacterium lepromatosis

Francisco J Silva et al. Microbiol Spectr. .

Abstract

Leprosy is caused by Mycobacterium leprae and Mycobacterium lepromatosis. We report construction and analyses of the complete genome sequence of M. lepromatosis FJ924. The genome contained 3,271,694 nucleotides to encode 1,789 functional genes and 1,564 pseudogenes. It shared 1,420 genes and 885 pseudogenes (71.4%) with M. leprae but differed in 1,281 genes and pseudogenes (28.6%). In phylogeny, the leprosy bacilli started from a most recent common ancestor (MRCA) that diverged ~30 million years ago (Mya) from environmental organism Mycobacterium haemophilum. The MRCA then underwent reductive evolution with pseudogenization, gene loss, and chromosomal rearrangements. Analysis of the shared pseudogenes estimated the pseudogenization event ~14 Mya, shortly before species bifurcation. Afterwards, genomic changes occurred to lesser extent in each species. Like M. leprae, four major types of highly repetitive sequences were detected in M. lepromatosis, contributing to chromosomal rearrangements within and after MRCA. Variations in genes and copy numbers were noted, such as three copies of the gene encoding bifunctional diguanylate cyclase/phosphodiesterase in M. lepromatosis, but single copy in M. leprae; 6 genes encoding the TetR family transcriptional regulators in M. lepromatosis, but 11 such genes in M. leprae; presence of hemW gene in M. lepromatosis, but absence in M. leprae; and others. These variations likely aid unique pathogenesis, such as diffuse lepromatous leprosy associated with M. lepromatosis, while the shared genomic features should explain the common pathogenesis of dermatitis and neuritis in leprosy. Together, these findings and the genomic data of M. lepromatosis may facilitate future research and care for leprosy. IMPORTANCE Leprosy is a dreaded infection that still affects millions of people worldwide. Mycobacterium lepromatosis is a recently recognized cause in addition to the well-known Mycobacterium leprae. M. lepromatosis is likely specific for diffuse lepromatous leprosy, a severe form of the infection and endemic in Mexico. This study constructed and annotated the complete genome sequence of M. lepromatosis FJ924 and performed comparative genomic analyses with related mycobacteria. The results afford new and refined insights into the genome size, gene repertoire, pseudogenes, phylogenomic relationship, genome organization and plasticity, process and timing of reductive evolution, and genetic and proteomic basis for pathogenesis. The availability of the complete M. lepromatosis genome may prove to be useful for future research and care for the infection.

Keywords: Mycobacterium leprae; Mycobacterium lepromatosis; genomics; leprosy; reductive evolution.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

FIG 1
FIG 1
Phylogenomic tree of selected mycobacterial species. ML tree was inferred using a concatenated conserved protein alignment (316,716 amino acid positions) under a JTT+F+R3 substitution model. Support values were obtained with 1000 ultrafast bootstraps (right node labels) and 1000 SH-aLRT (left node labels).
FIG 2
FIG 2
Synteny breaks and repeat sequences identified in M. lepromatosis FJ924. The outside circle displays the 24 synteny breaks (black) detected in the genome comparing with M. leprae. The inner circle displays the positions of repeat sequences: RLPM (purple), LPMREP (red), REPLPM (blue), LPMRPT (yellow) repeat families, and other unidentified repeats (orange). Arrowheads mark the coincidences between the positions of breaks and repeat sequences.
FIG 3
FIG 3
Synteny among mycobacterial genomes. (Left) Genome rearrangement phylogeny obtained with the Neighbor joining method and a distance matrix showing the minimal number of inversion events required to explain the differences between any pair of genomes obtained with GRIMM. Branch lengths are the numbers of inversion events estimated with the phylogenetic method. (Right) Graphic linear representation of the genome rearrangements observed comparing the four mycobacterial genomes. The graph was obtained with Mauve and displayed with genoPlotR.
FIG 4
FIG 4
Comparative analyses of mycobacterial proteomes. UpSet plot showing the computed coding gene clusters (intersections) derived from the proteomes encoded in the complete genomes of M. lepromatosis FJ924, M. leprae Br4923, M. haemophilum DSM44634, and M. tuberculosis H37Rv. Shared coding gene clusters (core) and M. lepromatosis FJ924 specific coding gene clusters are highlighted in yellow and blue, respectively.
FIG 5
FIG 5
Relative ages of pseudogenes. Histogram with the relative ages estimated for four types of pseudogenes. A value of 1 corresponds to the time of divergence between M. haemophilum and the leprosy bacilli. A value of 0 is present time. From left to right: M. leprae shared pseudogenes (with an orthologous pseudogene in M. lepromatosis, green color), M. lepromatosis shared pseudogenes (with an orthologous pseudogene in M. leprae, orange color), M. leprae unique pseudogenes (with a CDS annotated in M. lepromatosis, blue color) and M. lepromatosis unique pseudogenes (with a CDS annotated in M. leprae, pink color).
FIG 6
FIG 6
Divergence times of mycobacterial species. A timetree analysis using the RelTime method with a concatenated alignment (316,716 sites amino acid sites) was used. Evolutionary model was JTT+G+I+F. The calibration node is marked (red). Divergence time in millions of years ago.
FIG 7
FIG 7
Phylogenetic reconstruction of diguanylate cyclase/phosphodiesterase genes in selected mycobacterial species. Maximum likelihood phylogeny obtained with the amino acid alignment (601 sites), using the evolutionary model JTT+G. Numbers at the nodes indicate bootstrap values obtained with 500 replicates.

References

    1. Gelber RH. 2005. Leprosy (Hansen’s Disease). P 966–972. In Kasper DL, Braunwald E, Fauci AS, Hauser SL, Longo DL, Jameson JL (ed), Harrison’s principles of internal medicine, 16th ed McGraw-Hill, New York.
    1. Latapí F, Chevez Zamora A. 1948. The “spotted” leprosy of Lucio: an introduction to its clinical and histological study. Int J Lepr 16:421–430.
    1. Hansen GHA. 1874. Undersøgelser angående spedalskhedens årsager. Nor Mag Laegevidenskaben 9:1–88.
    1. Han XY, Seo Y-H, Sizer KC, Schoberle T, May GS, Spencer JS, Li W, Nair RG. 2008. A new Mycobacterium species causing diffuse lepromatous leprosy. Am J Clin Pathol 130:856–864. doi:10.1309/AJCPP72FJZZRRVMM. - DOI - PubMed
    1. Cole ST, Eiglmeier K, Parkhill J, James KD, Thomson NR, Wheeler PR, Honoré N, Garnier T, Churcher C, Harris D, Mungall K, Basham D, Brown D, Chillingworth T, Connor R, Davies RM, Devlin K, Duthoy S, Feltwell T, Fraser A, Hamlin N, Holroyd S, Hornsby T, Jagels K, Lacroix C, Maclean J, Moule S, Murphy L, Oliver K, Quail MA, Rajandream MA, Rutherford KM, Rutter S, Seeger K, Simon S, Simmonds M, Skelton J, Squares R, Squares S, Stevens K, Taylor K, Whitehead S, Woodward JR, Barrell BG. 2001. Massive gene decay in the leprosy bacillus. Nature 409:1007–1011. doi:10.1038/35059006. - DOI - PubMed

Publication types

Supplementary concepts