A systematic comparison of supervised classifiers
- PMID: 24763312
- PMCID: PMC3998948
- DOI: 10.1371/journal.pone.0094137
A systematic comparison of supervised classifiers
Abstract
Pattern recognition has been employed in a myriad of industrial, commercial and academic applications. Many techniques have been devised to tackle such a diversity of applications. Despite the long tradition of pattern recognition research, there is no technique that yields the best classification in all scenarios. Therefore, as many techniques as possible should be considered in high accuracy applications. Typical related works either focus on the performance of a given algorithm or compare various classification methods. In many occasions, however, researchers who are not experts in the field of machine learning have to deal with practical classification tasks without an in-depth knowledge about the underlying parameters. Actually, the adequate choice of classifiers and parameters in such practical circumstances constitutes a long-standing problem and is one of the subjects of the current paper. We carried out a performance study of nine well-known classifiers implemented in the Weka framework and compared the influence of the parameter configurations on the accuracy. The default configuration of parameters in Weka was found to provide near optimal performance for most cases, not including methods such as the support vector machine (SVM). In addition, the k-nearest neighbor method frequently allowed the best accuracy. In certain conditions, it was possible to improve the quality of SVM by more than 20% with respect to their default parameter configuration.
Conflict of interest statement
Figures
















Similar articles
-
Computer-assisted lip diagnosis on Traditional Chinese Medicine using multi-class support vector machines.BMC Complement Altern Med. 2012 Aug 16;12:127. doi: 10.1186/1472-6882-12-127. BMC Complement Altern Med. 2012. PMID: 22898352 Free PMC article.
-
Classification of THz pulse signals using two-dimensional cross-correlation feature extraction and non-linear classifiers.Comput Methods Programs Biomed. 2016 Apr;127:64-82. doi: 10.1016/j.cmpb.2016.01.017. Epub 2016 Feb 1. Comput Methods Programs Biomed. 2016. PMID: 27000290
-
Reviewing ensemble classification methods in breast cancer.Comput Methods Programs Biomed. 2019 Aug;177:89-112. doi: 10.1016/j.cmpb.2019.05.019. Epub 2019 May 20. Comput Methods Programs Biomed. 2019. PMID: 31319964 Review.
-
Prediction of heart disease and classifiers' sensitivity analysis.BMC Bioinformatics. 2020 Jul 2;21(1):278. doi: 10.1186/s12859-020-03626-y. BMC Bioinformatics. 2020. PMID: 32615980 Free PMC article.
-
A Review on Machine Learning for EEG Signal Processing in Bioengineering.IEEE Rev Biomed Eng. 2021;14:204-218. doi: 10.1109/RBME.2020.2969915. Epub 2021 Jan 22. IEEE Rev Biomed Eng. 2021. PMID: 32011262 Review.
Cited by
-
Choosing the Most Effective Pattern Classification Model under Learning-Time Constraint.PLoS One. 2015 Jun 26;10(6):e0129947. doi: 10.1371/journal.pone.0129947. eCollection 2015. PLoS One. 2015. PMID: 26114552 Free PMC article.
-
Probing the topological properties of complex networks modeling short written texts.PLoS One. 2015 Feb 26;10(2):e0118394. doi: 10.1371/journal.pone.0118394. eCollection 2015. PLoS One. 2015. PMID: 25719799 Free PMC article.
-
Clustering algorithms: A comparative approach.PLoS One. 2019 Jan 15;14(1):e0210236. doi: 10.1371/journal.pone.0210236. eCollection 2019. PLoS One. 2019. PMID: 30645617 Free PMC article.
-
Using full-text content to characterize and identify best seller books: A study of early 20th-century literature.PLoS One. 2024 Apr 26;19(4):e0302070. doi: 10.1371/journal.pone.0302070. eCollection 2024. PLoS One. 2024. PMID: 38669247 Free PMC article.
-
Authorship attribution based on Life-Like Network Automata.PLoS One. 2018 Mar 22;13(3):e0193703. doi: 10.1371/journal.pone.0193703. eCollection 2018. PLoS One. 2018. PMID: 29566100 Free PMC article.
References
-
- Mayer-Schonberger V, Cukier K (2013) Big Data: a revolution that will transform how we live, work, and think. Eamon Dolan/Houghton Mifflin Harcourt.
-
- Sathi A (2013) Big Data analytics: disruptive technologies for changing the game. Mc Press.
-
- Montavon G, Rupp M, Gobre V, Vazquez-Mayagoitia A, Hansen K, Tkatchenko A, Mller K-R, Lilienfeld OA (2013) Machine learning of molecular electronic properties in chemical compound space. New Journal of Physics 15: 095003.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources