Biological applications of support vector machines
- PMID: 15606969
- DOI: 10.1093/bib/5.4.328
Biological applications of support vector machines
Abstract
One of the major tasks in bioinformatics is the classification and prediction of biological data. With the rapid increase in size of the biological databanks, it is essential to use computer programs to automate the classification process. At present, the computer programs that give the best prediction performance are support vector machines (SVMs). This is because SVMs are designed to maximise the margin to separate two classes so that the trained model generalises well on unseen data. Most other computer programs implement a classifier through the minimisation of error occurred in training, which leads to poorer generalisation. Because of this, SVMs have been widely applied to many areas of bioinformatics including protein function prediction, protease functional site recognition, transcription initiation site prediction and gene expression data classification. This paper will discuss the principles of SVMs and the applications of SVMs to the analysis of biological data, mainly protein and DNA sequences.
Similar articles
-
Feature space interpretation of SVMs with indefinite kernels.IEEE Trans Pattern Anal Mach Intell. 2005 Apr;27(4):482-492. doi: 10.1109/TPAMI.2005.78. IEEE Trans Pattern Anal Mach Intell. 2005. PMID: 15794155
-
Secondary structure prediction with support vector machines.Bioinformatics. 2003 Sep 1;19(13):1650-5. doi: 10.1093/bioinformatics/btg223. Bioinformatics. 2003. PMID: 12967961
-
Bio-support vector machines for computational proteomics.Bioinformatics. 2004 Mar 22;20(5):735-41. doi: 10.1093/bioinformatics/btg477. Epub 2004 Jan 29. Bioinformatics. 2004. PMID: 14751989
-
Support vector machine applications in bioinformatics.Appl Bioinformatics. 2003;2(2):67-77. Appl Bioinformatics. 2003. PMID: 15130823 Review.
-
Pattern recognition methods for protein functional site prediction.Curr Protein Pept Sci. 2005 Oct;6(5):479-91. doi: 10.2174/138920305774329322. Curr Protein Pept Sci. 2005. PMID: 16248799 Review.
Cited by
-
Kidney protective effects of baroreflex activation therapy in patients with resistant hypertension.J Clin Hypertens (Greenwich). 2018 Oct;20(10):1519-1526. doi: 10.1111/jch.13365. Epub 2018 Sep 10. J Clin Hypertens (Greenwich). 2018. PMID: 30203514 Free PMC article.
-
Identification of Heparan Sulfate in Dilated Cardiomyopathy by Integrated Bioinformatics Analysis.Front Cardiovasc Med. 2022 May 27;9:900428. doi: 10.3389/fcvm.2022.900428. eCollection 2022. Front Cardiovasc Med. 2022. PMID: 35711374 Free PMC article.
-
A Baybayin word recognition system.PeerJ Comput Sci. 2021 Jun 16;7:e596. doi: 10.7717/peerj-cs.596. eCollection 2021. PeerJ Comput Sci. 2021. PMID: 34239976 Free PMC article.
-
CASAnova: a multiclass support vector machine model for the classification of human sperm motility patterns.Biol Reprod. 2017 Nov 1;97(5):698-708. doi: 10.1093/biolre/iox120. Biol Reprod. 2017. PMID: 29036474 Free PMC article.
-
Evaluating the factors influencing accuracy, interpretability, and reproducibility in the use of machine learning classifiers in biology to enable standardization.Sci Rep. 2025 May 13;15(1):16651. doi: 10.1038/s41598-025-00245-6. Sci Rep. 2025. PMID: 40360553 Free PMC article.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources