Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 1;8(12):3751-3764.
doi: 10.1093/gbe/evw279.

Mycobacterium tuberculosis Complex Exhibits Lineage-Specific Variations Affecting Protein Ductility and Epitope Recognition

Affiliations

Mycobacterium tuberculosis Complex Exhibits Lineage-Specific Variations Affecting Protein Ductility and Epitope Recognition

Inmaculada Yruela et al. Genome Biol Evol. .

Abstract

The advent of whole-genome sequencing has provided an unprecedented detail about the evolution and genetic significance of species-specific variations across the whole Mycobacterium tuberculosis Complex. However, little attention has been focused on understanding the functional roles of these variations in the protein coding sequences. In this work, we compare the coding sequences from 74 sequenced mycobacterial species including M. africanum, M. bovis, M. canettii, M. caprae, M. orygis, and M. tuberculosis. Results show that albeit protein variations affect all functional classes, those proteins involved in lipid and intermediary metabolism and respiration have accumulated mutations during evolution. To understand the impact of these mutations on protein functionality, we explored their implications on protein ductility/disorder, a yet unexplored feature of mycobacterial proteomes. In agreement with previous studies, we found that a Gly71Ile substitution in the PhoPR virulence system severely affects the ductility of its nearby region in M. africanum and animal-adapted species. In the same line of evidence, the SmtB transcriptional regulator shows amino acid variations specific to the Beijing lineage, which affects the flexibility of the N-terminal trans-activation domain. Furthermore, despite the fact that MTBC epitopes are evolutionary hyperconserved, we identify strain- and lineage-specific amino acid mutations affecting previously known T-cell epitopes such as EsxH and FbpA (Ag85A). Interestingly, in silico studies reveal that these variations result in differential interaction of epitopes with the main HLA haplogroups.

Keywords: Mycobacterium; coding sequences; epitope polymorphisms; epitope-HLA binding; lineages; protein ductility.

PubMed Disclaimer

Figures

<sc>Fig</sc>. 1.—
Fig. 1.—
Schematic phylogenetic relationships of the MTBC showing the number of genome sequences analyzed in this work. The figure also shows the phylogenetic distribution of M. canettii, the L1–L7 lineages and the animal-adapted species, as well as the geographic distribution of each lineage and the preferred host.
<sc>Fig</sc>. 2.—
Fig. 2.—
(A) Overall distribution of proteins encoded by the core-genome of M. tuberculosis H37Rv (right panel) and the average mutated proteins found in 74 species of the MTBC analyzed in this work (left panel). Both diagrams are divided according to functional classes. (B) Distribution according to functional classes of mutated proteins with experimental evidence annotated in BioCyc of M. canettii (dark blue), M. africanum (red), M. bovis AF2122 (green), M. tuberculosis Beijiing (violet), M. orygis (ligth blue), and M. caprae (orange). Cluster 0, virulence, detoxification and adaptation; cluster 1, lipid metabolism; cluster 2, information pathways; cluster 3, cell wall and cell processes; cluster 5, insertion sequences and phages; cluster 6, proline–glutamate (PE) and proline–proline–glutamate (PPE) proteins; cluster 7, intermediary metabolism and respiration; cluster 8, unknown proteins; cluster 9, regulatory proteins; cluster 10, conserved hypothetical proteins; cluster 16, conserved hypothetical proteins with an orthologue in M. bovis. The percentages are normalized taking into account the total proteins associated to each functional group (A) and the sequence identity averages (B).
<sc>Fig</sc>. 3.—
Fig. 3.—
(A) Schematic representation of the topology of PhoR. Note the presence of two transmembrane helices spanning the periplasmic loop which contains the Gly71Ile mutation present in M. africanum L6 and animal-adapted species. (B) Disorder and molecular recognition motif predictions in the 61–121 amino acid segments of the periplasmic sensor loop of PhoR from M. tuberculosis and M. bovis/M.africanum L6. Y-axis values ≥ 0.5 means high probability of disorder. Note that the Gly71Ile mutation severely affects the ductility of the sensor domain. Predictions were carried out with IUPred and ANCHOR (left panel, red and blue lines, respectively), MoRFPred (middle panel, red line), and RONN (right panel, red line) (for details see “Materials and Methods” section).
<sc>Fig</sc>. 4.—
Fig. 4.—
(A) Average distribution of disordered residues in the proteins encoded by the core-genome of M. tuberculosis, essential proteins, nonessential proteins, antigens, and epitopes. (B) Percentage of intrinsically disordered/ductile proteins (IDPs) encoded by the core-genome of M. tuberculosis versus the length of disordered segments. (C) Frequency of amino acids within the disordered/ductile segments (L > 30).
<sc>Fig</sc>. 5.—
Fig. 5.—
Structural models of SmtB transcription factor of M. tuberculosis H37Rv (magenta) and M. tuberculosis Beijing (light blue) using the NMR structure of NmtR transcription factor of M. tuberculosis (pdb 2lkp) as template. The residues of the non-conserved motif Phe30, Ala31, Glu32, Cys33, Thr35, Phe36, Pro37 in M. tuberculosis H37Rv and Phe30, Ser31, Thr32, Ala33, Gly35, Gly36, Pro37 in M. tuberculosis Beijing are shown in sticks. The three-dimensional cartoons were drawn using PyMol 1.4.1 (Schrodinger LLC).
<sc>Fig</sc>. 6.—
Fig. 6.—
(A) dN/dS in MTBC species calculated for nonessential genes, essential genes and antigens. Note that antigens present dN/dS ratios comparable to essential and nonessential genes, indicative of antigen hyperconservation in the whole MTBC. (B) Localization in the M. tuberculosis H37Rv chromosome of T-cell antigens showing one (black dots) or more than one (red dots) polymorphism in the MTBC species analyzed in this work. The inner circle indicates the protein name and gene number according to H37Rv nomenclature. The number of epitopes contained in every antigen is provided in parentheses. The outer circumference contains information about those MTBC species carrying epitope mutations. For detailed information about every polymorphism see supplementary table S3, Supplementary Material online.
<sc>Fig</sc>. 7.—
Fig. 7.—
(A) Native and (B) mutated structures of the EsxG/EsxH (Rv0287/Rv0288) antigens (pdb 2kg7; Ilghari et al. 2011). The EsxG (yellow) and EsxH (blue) proteins are shown in ribbon. The amino acids Arg 57 of EsxG and Ala71/Ser71 and Met72 of EsxH are shown as spheres. The three-dimensional cartoons were drawn using PyMol 1.4.1 (Schrodinger LLC).
<sc>Fig</sc>. 8.—
Fig. 8.—
In silico prediction of the interaction of epitopes with the main HLA class II haplotypes. Each graph represents the population coverage of HLA class II alleles predicted to bind specific epitope variants (Y-axis) across 15 geographic regions (X-axis). Population coverages for the M. tuberculosis H37Rv epitope are indicated by orange lines and population coverages for the mutant variants in other MTBC species are shown by blue lines.

References

    1. Andreatta M, et al.2015. Accurate pan-specific prediction of peptide-MHC class II binding affinity with improved binding core identification. Immunogenetics 67:641–650. - PMC - PubMed
    1. Andreatta M, Nielsen M. 2015. Gapped sequence alignment using artificial neural networks: application to the MHC class I system. Bioinformatics 32:511–517. - PMC - PubMed
    1. Aranaz A, Cousins D, Mateos A, Dominguez L. 2003. Elevation of Mycobacterium tuberculosis subsp. caprae Aranaz et al. 1999 to species rank as Mycobacterium caprae comb. nov., sp. nov. Int J Syst Evol Microbiol. 53:1785–1789. - PubMed
    1. Babu MM, van der Lee R, de Groot NS, Gsponer J. 2011. Intrinsically disordered proteins: regulation and disease. Curr Opin Struct Biol. 21:432–440. - PubMed
    1. Blouin Y, et al.2014. Progenitor “Mycobacterium canettii” clone responsible for lymph node tuberculosis epidemic, Djibouti. Emerg Infect Dis. 20:21–28. - PMC - PubMed

Publication types