FSPP: A Tool for Genome-Wide Prediction of smORF-Encoded Peptides and Their Functions
- PMID: 29675032
- PMCID: PMC5896265
- DOI: 10.3389/fgene.2018.00096
FSPP: A Tool for Genome-Wide Prediction of smORF-Encoded Peptides and Their Functions
Abstract
smORFs are small open reading frames of less than 100 codons. Recent low throughput experiments showed a lot of smORF-encoded peptides (SEPs) played crucial rule in processes such as regulation of transcription or translation, transportation through membranes and the antimicrobial activity. In order to gather more functional SEPs, it is necessary to have access to genome-wide prediction tools to give profound directions for low throughput experiments. In this study, we put forward a functional smORF-encoded peptides predictor (FSPP) which tended to predict authentic SEPs and their functions in a high throughput method. FSPP used the overlap of detected SEPs from Ribo-seq and mass spectrometry as target objects. With the expression data on transcription and translation levels, FSPP built two co-expression networks. Combing co-location relations, FSPP constructed a compound network and then annotated SEPs with functions of adjacent nodes. Tested on 38 sequenced samples of 5 human cell lines, FSPP successfully predicted 856 out of 960 annotated proteins. Interestingly, FSPP also highlighted 568 functional SEPs from these samples. After comparison, the roles predicted by FSPP were consistent with known functions. These results suggest that FSPP is a reliable tool for the identification of functional small peptides. FSPP source code can be acquired at https://www.bioinfo.org/FSPP.
Keywords: MS; Ribo-seq; SEP; function; smORF.
Figures




Similar articles
-
Improved Identification and Analysis of Small Open Reading Frame Encoded Polypeptides.Anal Chem. 2016 Apr 5;88(7):3967-75. doi: 10.1021/acs.analchem.6b00191. Epub 2016 Mar 24. Anal Chem. 2016. PMID: 27010111 Free PMC article.
-
In Search of Lost Small Peptides.Annu Rev Cell Dev Biol. 2017 Oct 6;33:391-416. doi: 10.1146/annurev-cellbio-100616-060516. Epub 2017 Jul 31. Annu Rev Cell Dev Biol. 2017. PMID: 28759257 Review.
-
Identification and analysis of small proteins and short open reading frame encoded peptides in Hep3B cell.J Proteomics. 2021 Jan 6;230:103965. doi: 10.1016/j.jprot.2020.103965. Epub 2020 Sep 3. J Proteomics. 2021. PMID: 32891891
-
BONCAT-based Profiling of Nascent Small and Alternative Open Reading Frame-encoded Proteins.Bio Protoc. 2023 Jan 5;13(1):e4585. doi: 10.21769/BioProtoc.4585. eCollection 2023 Jan 5. Bio Protoc. 2023. PMID: 36789088 Free PMC article.
-
Chemical labeling and proteomics for characterization of unannotated small and alternative open reading frame-encoded polypeptides.Biochem Soc Trans. 2023 Jun 28;51(3):1071-1082. doi: 10.1042/BST20221074. Biochem Soc Trans. 2023. PMID: 37171061 Free PMC article. Review.
Cited by
-
CPPred: coding potential prediction based on the global description of RNA sequence.Nucleic Acids Res. 2019 May 7;47(8):e43. doi: 10.1093/nar/gkz087. Nucleic Acids Res. 2019. PMID: 30753596 Free PMC article.
-
IRSOM2: a web server for predicting bifunctional RNAs.Nucleic Acids Res. 2023 Jul 5;51(W1):W281-W288. doi: 10.1093/nar/gkad381. Nucleic Acids Res. 2023. PMID: 37158254 Free PMC article.
-
Massively integrated coexpression analysis reveals transcriptional regulation, evolution and cellular implications of the yeast noncanonical translatome.Genome Biol. 2024 Jul 8;25(1):183. doi: 10.1186/s13059-024-03287-7. Genome Biol. 2024. PMID: 38978079 Free PMC article.
-
Tutorial: guidelines for the use of machine learning methods to mine genomes and proteomes for antibiotic discovery.Nat Protoc. 2025 May 14. doi: 10.1038/s41596-025-01144-w. Online ahead of print. Nat Protoc. 2025. PMID: 40369233 Review.
-
SynMyco transposon: engineering transposon vectors for efficient transformation of minimal genomes.DNA Res. 2019 Aug 1;26(4):327-339. doi: 10.1093/dnares/dsz012. DNA Res. 2019. PMID: 31257417 Free PMC article.
References
-
- Akimoto C., Sakashita E., Kasashima K., Kuroiwa K., Tominaga K., Hamamoto T., et al. (2013). Translational repression of the McKusick–Kaufman syndrome transcript by unique upstream open reading frames encoding mitochondrial proteins with alternative polyadenylation sites. Biochim. Biophys. Acta 1830 2728–2738. 10.1016/j.bbagen.2012.12.010 - DOI - PubMed
LinkOut - more resources
Full Text Sources
Other Literature Sources