Proteogenomic Methods to Improve Genome Annotation
- PMID: 26867739
- DOI: 10.1007/978-1-4939-3524-6_5
Proteogenomic Methods to Improve Genome Annotation
Abstract
Annotation of protein coding genes in sequenced genomes has been routinely carried out using gene prediction programs guided by available transcript data. The advent of mass spectrometry has enabled the identification of proteins in a high-throughput manner. In addition to searching proteins annotated in public databases, mass spectrometry data can also be searched against conceptually translated genome as well as transcriptome to identify novel protein coding regions. This proteogenomics approach has resulted in the identification of novel protein coding regions in both prokaryotic and eukaryotic genomes. These studies have also revealed that some of the annotated noncoding RNAs and pseudogenes code for proteins. This approach is likely to become a part of most genome annotation workflows in the future. Here we describe a general methodology and approach that can be used for proteogenomics.
Keywords: Mass spectrometry; Noncoding RNAs; Novel proteins; Proteogenomics; Pseudogenes.
Similar articles
-
Improving GENCODE reference gene annotation using a high-stringency proteogenomics workflow.Nat Commun. 2016 Jun 2;7:11778. doi: 10.1038/ncomms11778. Nat Commun. 2016. PMID: 27250503 Free PMC article.
-
An integrative strategy to identify the entire protein coding potential of prokaryotic genomes by proteogenomics.Genome Res. 2017 Dec;27(12):2083-2095. doi: 10.1101/gr.218255.116. Epub 2017 Nov 15. Genome Res. 2017. PMID: 29141959 Free PMC article.
-
Proteogenomic Tools and Approaches to Explore Protein Coding Landscapes of Eukaryotic Genomes.Adv Exp Med Biol. 2016;926:1-10. doi: 10.1007/978-3-319-42316-6_1. Adv Exp Med Biol. 2016. PMID: 27686802 Review.
-
Proteogenomics: Recycling Public Data to Improve Genome Annotations.Methods Enzymol. 2017;585:217-243. doi: 10.1016/bs.mie.2016.09.020. Epub 2016 Nov 29. Methods Enzymol. 2017. PMID: 28109431
-
Proteogenomics of rare taxonomic phyla: A prospective treasure trove of protein coding genes.Proteomics. 2016 Jan;16(2):226-40. doi: 10.1002/pmic.201500263. Epub 2015 Nov 23. Proteomics. 2016. PMID: 26773550 Review.
Cited by
-
Phosphotyrosine Profiling Using SILAC.Methods Mol Biol. 2023;2603:117-125. doi: 10.1007/978-1-0716-2863-8_9. Methods Mol Biol. 2023. PMID: 36370274
-
Peptimapper: proteogenomics workflow for the expert annotation of eukaryotic genomes.BMC Genomics. 2019 Jan 17;20(1):56. doi: 10.1186/s12864-019-5431-9. BMC Genomics. 2019. PMID: 30654742 Free PMC article.
-
Integrated Transcriptomic and Proteomic Analysis of Primary Human Umbilical Vein Endothelial Cells.Proteomics. 2019 Aug;19(15):e1800315. doi: 10.1002/pmic.201800315. Epub 2019 Jun 26. Proteomics. 2019. PMID: 30983154 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources