Why are they missing? : Bioinformatics characterization of missing human proteins
- PMID: 27535355
- DOI: 10.1016/j.jprot.2016.08.005
Why are they missing? : Bioinformatics characterization of missing human proteins
Abstract
NeXtProt is a web-based protein knowledge platform that supports research on human proteins. NeXtProt (release 2015-04-28) lists 20,060 proteins, among them, 3373 canonical proteins (16.8%) lack credible experimental evidence at protein level (PE2:PE5). Therefore, they are considered as "missing proteins". A comprehensive bioinformatic workflow has been proposed to analyze these "missing" proteins. The aims of current study were to analyze physicochemical properties, existence and distribution of the tryptic cleavage sites, and to pinpoint the signature peptides of the missing proteins. Our findings showed that 23.7% of missing proteins were hydrophobic proteins possessing transmembrane domains (TMD). Also, forty missing entries generate tryptic peptides were either out of mass detection range (>30aa) or mapped to different proteins (<9aa). Additionally, 21% of missing entries didn't generate any unique tryptic peptides. In silico endopeptidase combination strategy increased the possibility of missing proteins identification. Coherently, using both mature protein database and signal peptidome database could be a promising option to identify some missing proteins by targeting their unique N-terminal tryptic peptide from mature protein database and or C-terminus tryptic peptide from signal peptidome database. In conclusion, Identification of missing protein requires additional consideration during sample preparation, extraction, digestion and data analysis to increase its incidence of identification.
Keywords: Bioinformatics; Missing protein; Signal peptidome; Transmembrane domain.
Copyright © 2016. Published by Elsevier B.V.
Similar articles
-
In Silico Peptide Repertoire of Human Olfactory Receptor Proteomes on High-Stringency Mass Spectrometry.J Proteome Res. 2019 Dec 6;18(12):4117-4123. doi: 10.1021/acs.jproteome.8b00494. Epub 2019 May 22. J Proteome Res. 2019. PMID: 31046287
-
Informatics View on the Challenges of Identifying Missing Proteins from Shotgun Proteomics.J Proteome Res. 2015 Dec 4;14(12):5396-407. doi: 10.1021/acs.jproteome.5b00482. Epub 2015 Nov 19. J Proteome Res. 2015. PMID: 26549055
-
Probing the Missing Human Proteome: A Computational Perspective.J Proteome Res. 2015 Dec 4;14(12):4949-58. doi: 10.1021/acs.jproteome.5b00728. Epub 2015 Oct 5. J Proteome Res. 2015. PMID: 26407240
-
Combination of Multiple Spectral Libraries Improves the Current Search Methods Used to Identify Missing Proteins in the Chromosome-Centric Human Proteome Project.J Proteome Res. 2015 Dec 4;14(12):4959-66. doi: 10.1021/acs.jproteome.5b00578. Epub 2015 Sep 14. J Proteome Res. 2015. PMID: 26330117
-
Bioinformatics and Computer Simulation Approaches to the Discovery and Analysis of Bioactive Peptides.Curr Pharm Biotechnol. 2022;23(13):1541-1555. doi: 10.2174/1389201023666220106161016. Curr Pharm Biotechnol. 2022. PMID: 34994325 Review.
Cited by
-
Bioinformatic Analysis of WNT Family Proteins.Bioinform Biol Insights. 2025 Jul 15;19:11779322251353347. doi: 10.1177/11779322251353347. eCollection 2025. Bioinform Biol Insights. 2025. PMID: 40673210 Free PMC article.
-
Are Antisense Proteins in Prokaryotes Functional?Front Mol Biosci. 2020 Aug 14;7:187. doi: 10.3389/fmolb.2020.00187. eCollection 2020. Front Mol Biosci. 2020. PMID: 32923454 Free PMC article.
-
Protocol for Increasing the Sensitivity of MS-Based Protein Detection in Human Chorionic Villi.Curr Issues Mol Biol. 2022 May 9;44(5):2069-2088. doi: 10.3390/cimb44050140. Curr Issues Mol Biol. 2022. PMID: 35678669 Free PMC article.
-
Accelerating the search for the missing proteins in the human proteome.Nat Commun. 2017 Jan 24;8:14271. doi: 10.1038/ncomms14271. Nat Commun. 2017. PMID: 28117396 Free PMC article. Review.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources