repRNA: a web server for generating various feature vectors of RNA sequences
- PMID: 26085220
- DOI: 10.1007/s00438-015-1078-7
repRNA: a web server for generating various feature vectors of RNA sequences
Abstract
With the rapid growth of RNA sequences generated in the postgenomic age, it is highly desired to develop a flexible method that can generate various kinds of vectors to represent these sequences by focusing on their different features. This is because nearly all the existing machine-learning methods, such as SVM (support vector machine) and KNN (k-nearest neighbor), can only handle vectors but not sequences. To meet the increasing demands and speed up the genome analyses, we have developed a new web server, called "representations of RNA sequences" (repRNA). Compared with the existing methods, repRNA is much more comprehensive, flexible and powerful, as reflected by the following facts: (1) it can generate 11 different modes of feature vectors for users to choose according to their investigation purposes; (2) it allows users to select the features from 22 built-in physicochemical properties and even those defined by users' own; (3) the resultant feature vectors and the secondary structures of the corresponding RNA sequences can be visualized. The repRNA web server is freely accessible to the public at http://bioinformatics.hitsz.edu.cn/repRNA/ .
Keywords: Physicochemical properties; PseAAC; PseKNC; Secondary structure of RNA; User-defined properties; repDNA; repRNA.
Similar articles
-
Pse-in-One: a web server for generating various modes of pseudo components of DNA, RNA, and protein sequences.Nucleic Acids Res. 2015 Jul 1;43(W1):W65-71. doi: 10.1093/nar/gkv458. Epub 2015 May 9. Nucleic Acids Res. 2015. PMID: 25958395 Free PMC article.
-
repDNA: a Python package to generate various modes of feature vectors for DNA sequences by incorporating user-defined physicochemical properties and sequence-order effects.Bioinformatics. 2015 Apr 15;31(8):1307-9. doi: 10.1093/bioinformatics/btu820. Epub 2014 Dec 10. Bioinformatics. 2015. PMID: 25504848
-
PseKNC: a flexible web server for generating pseudo K-tuple nucleotide composition.Anal Biochem. 2014 Jul 1;456:53-60. doi: 10.1016/j.ab.2014.04.001. Epub 2014 Apr 13. Anal Biochem. 2014. PMID: 24732113
-
Pseudo nucleotide composition or PseKNC: an effective formulation for analyzing genomic sequences.Mol Biosyst. 2015 Oct;11(10):2620-34. doi: 10.1039/c5mb00155b. Mol Biosyst. 2015. PMID: 26099739 Review.
-
Comprehensive review and assessment of computational methods for predicting RNA post-transcriptional modification sites from RNA sequences.Brief Bioinform. 2020 Sep 25;21(5):1676-1696. doi: 10.1093/bib/bbz112. Brief Bioinform. 2020. PMID: 31714956 Review.
Cited by
-
iROS-gPseKNC: Predicting replication origin sites in DNA by incorporating dinucleotide position-specific propensity into general pseudo nucleotide composition.Oncotarget. 2016 Jun 7;7(23):34180-9. doi: 10.18632/oncotarget.9057. Oncotarget. 2016. PMID: 27147572 Free PMC article.
-
Comparison of genomic data via statistical distribution.J Theor Biol. 2016 Oct 21;407:318-327. doi: 10.1016/j.jtbi.2016.07.032. Epub 2016 Jul 25. J Theor Biol. 2016. PMID: 27460589 Free PMC article.
-
Adaboost-SVM-based probability algorithm for the prediction of all mature miRNA sites based on structured-sequence features.Sci Rep. 2019 Feb 6;9(1):1521. doi: 10.1038/s41598-018-38048-7. Sci Rep. 2019. PMID: 30728425 Free PMC article.
-
Molecular classification of prostate adenocarcinoma by the integrated somatic mutation profiles and molecular network.Sci Rep. 2017 Apr 7;7(1):738. doi: 10.1038/s41598-017-00872-8. Sci Rep. 2017. PMID: 28389666 Free PMC article.
-
iSS-PC: Identifying Splicing Sites via Physical-Chemical Properties Using Deep Sparse Auto-Encoder.Sci Rep. 2017 Aug 15;7(1):8222. doi: 10.1038/s41598-017-08523-8. Sci Rep. 2017. PMID: 28811565 Free PMC article.
References
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources