High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using Mycobacterium tuberculosis as an example
- PMID: 18597682
- PMCID: PMC2483986
- DOI: 10.1186/1471-2164-9-316
High accuracy mass spectrometry analysis as a tool to verify and improve gene annotation using Mycobacterium tuberculosis as an example
Abstract
Background: While the genomic annotations of diverse lineages of the Mycobacterium tuberculosis complex are available, divergences between gene prediction methods are still a challenge for unbiased protein dataset generation. M. tuberculosis gene annotation is an example, where the most used datasets from two independent institutions (Sanger Institute and Institute of Genomic Research-TIGR) differ up to 12% in the number of annotated open reading frames, and 46% of the genes contained in both annotations have different start codons. Such differences emphasize the importance of the identification of the sequence of protein products to validate each gene annotation including its sequence coding area.
Results: With this objective, we submitted a culture filtrate sample from M. tuberculosis to a high-accuracy LTQ-Orbitrap mass spectrometer analysis and applied refined N-terminal prediction to perform comparison of two gene annotations. From a total of 449 proteins identified from the MS data, we validated 35 tryptic peptides that were specific to one of the two datasets, representing 24 different proteins. From those, 5 proteins were only annotated in the Sanger database. In the remaining proteins, the observed differences were due to differences in annotation of transcriptional start sites.
Conclusion: Our results indicate that, even in a less complex sample likely to represent only 10% of the bacterial proteome, we were still able to detect major differences between different gene annotation approaches. This gives hope that high-throughput proteomics techniques can be used to improve and validate gene annotations, and in particular for verification of high-throughput, automatic gene annotations.
Figures






Similar articles
-
Proteogenomic analysis of polymorphisms and gene annotation divergences in prokaryotes using a clustered mass spectrometry-friendly database.Mol Cell Proteomics. 2011 Jan;10(1):M110.002527. doi: 10.1074/mcp.M110.002527. Epub 2010 Oct 28. Mol Cell Proteomics. 2011. PMID: 21030493 Free PMC article.
-
Proteogenomic analysis of Mycobacterium tuberculosis by high resolution mass spectrometry.Mol Cell Proteomics. 2011 Dec;10(12):M111.011627. doi: 10.1074/mcp.M111.011445. Epub 2011 Oct 3. Mol Cell Proteomics. 2011. PMID: 21969609 Free PMC article.
-
Validating divergent ORF annotation of the Mycobacterium leprae genome through a full translation data set and peptide identification by tandem mass spectrometry.Proteomics. 2009 Jun;9(12):3233-43. doi: 10.1002/pmic.200800955. Proteomics. 2009. PMID: 19562797
-
Proteogenomics: needs and roles to be filled by proteomics in genome annotation.Brief Funct Genomic Proteomic. 2008 Jan;7(1):50-62. doi: 10.1093/bfgp/eln010. Epub 2008 Mar 10. Brief Funct Genomic Proteomic. 2008. PMID: 18334489 Review.
-
Mycobacterium tuberculosis in the Proteomics Era.Microbiol Spectr. 2014 Apr;2(2). doi: 10.1128/microbiolspec.MGM2-0020-2013. Microbiol Spectr. 2014. PMID: 26105825 Review.
Cited by
-
Influence of allosteric regulators on individual steps in the reaction catalyzed by Mycobacterium tuberculosis 2-hydroxy-3-oxoadipate synthase.J Biol Chem. 2013 Jul 26;288(30):21688-702. doi: 10.1074/jbc.M113.465419. Epub 2013 Jun 11. J Biol Chem. 2013. PMID: 23760263 Free PMC article.
-
Deep coverage of the Escherichia coli proteome enables the assessment of false discovery rates in simple proteogenomic experiments.Mol Cell Proteomics. 2013 Nov;12(11):3420-30. doi: 10.1074/mcp.M113.029165. Epub 2013 Aug 1. Mol Cell Proteomics. 2013. PMID: 23908556 Free PMC article.
-
Proteomics for the Investigation of Mycobacteria.Acta Naturae. 2017 Jan-Mar;9(1):15-25. Acta Naturae. 2017. PMID: 28461970 Free PMC article.
-
Comparative omics-driven genome annotation refinement: application across Yersiniae.PLoS One. 2012;7(3):e33903. doi: 10.1371/journal.pone.0033903. Epub 2012 Mar 27. PLoS One. 2012. PMID: 22479471 Free PMC article.
-
Reannotation of translational start sites in the genome of Mycobacterium tuberculosis.Tuberculosis (Edinb). 2013 Jan;93(1):18-25. doi: 10.1016/j.tube.2012.11.012. Epub 2012 Dec 26. Tuberculosis (Edinb). 2013. PMID: 23273318 Free PMC article.
References
-
- World Health Organization. WHO Report 2007: Global tuberculosis control, surveillance, planning, financing. 2007.
-
- Cole ST, Brosch R, Parkhill J, Garnier T, Churcher C, Harris D, Gordon SV, Eiglmeier K, Gas S, Barry CE, 3rd, Tekaia F, Badcock K, Basham D, Brown D, Chillingworth T, Connor R, Davies R, Devlin K, Feltwell T, Gentles S, Hamlin N, Holroyd S, Hornsby T, Jagels K, Krogh A, McLean J, Moule S, Murphy L, Oliver K, Osborne J, Quail MA, Rajandream MA, Rogers J, Rutter S, Seeger K, Skelton J, Squares R, Squares S, Sulston JE, Taylor K, Whitehead S, Barrell BG. Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature. 1998;393:537–544. doi: 10.1038/31159. - DOI - PubMed
-
- Eiglmeier K, Simon S, Garnier T, Cole ST. The integrated genome map of Mycobacterium leprae. Leprosy review. 2001;72:462–469. - PubMed
-
- Fleischmann RD, Alland D, Eisen JA, Carpenter L, White O, Peterson J, DeBoy R, Dodson R, Gwinn M, Haft D, Hickey E, Kolonay JF, Nelson WC, Umayam LA, Ermolaeva M, Salzberg SL, Delcher A, Utterback T, Weidman J, Khouri H, Gill J, Mikula A, Bishai W, Jacobs Jr WR, Jr., Venter JC, Fraser CM. Whole-genome comparison of Mycobacterium tuberculosis clinical and laboratory strains. Journal of bacteriology. 2002;184:5479–5490. doi: 10.1128/JB.184.19.5479-5490.2002. - DOI - PMC - PubMed
-
- Garnier T, Eiglmeier K, Camus JC, Medina N, Mansoor H, Pryor M, Duthoy S, Grondin S, Lacroix C, Monsempe C, Simon S, Harris B, Atkin R, Doggett J, Mayes R, Keating L, Wheeler PR, Parkhill J, Barrell BG, Cole ST, Gordon SV, Hewinson RG. The complete genome sequence of Mycobacterium bovis. Proceedings of the National Academy of Sciences of the United States of America. 2003;100:7877–7882. doi: 10.1073/pnas.1130426100. - DOI - PMC - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources