On Supervised Classification of Feature Vectors with Independent and Non-Identically Distributed Elements
- PMID: 34441185
- PMCID: PMC8391840
- DOI: 10.3390/e23081045
On Supervised Classification of Feature Vectors with Independent and Non-Identically Distributed Elements
Abstract
In this paper, we investigate the problem of classifying feature vectors with mutually independent but non-identically distributed elements that take values from a finite alphabet set. First, we show the importance of this problem. Next, we propose a classifier and derive an analytical upper bound on its error probability. We show that the error probability moves to zero as the length of the feature vectors grows, even when there is only one training feature vector per label available. Thereby, we show that for this important problem at least one asymptotically optimal classifier exists. Finally, we provide numerical examples where we show that the performance of the proposed classifier outperforms conventional classification algorithms when the number of training data is small and the length of the feature vectors is sufficiently high.
Keywords: analytical error probability; independent and non-identically distributed features; supervised classification.
Conflict of interest statement
The authors declare no conflict of interest.
Figures











References
-
- Shalev-Shwartz S., Ben-David S. Understanding Machine Learning: From Theory to Algorithms. Cambridge University Press; Cambridge, MA, USA: 2014.
-
- Quinlan J.R. Learning Efficient Classification Procedures and Their Application to Chess End Games. Springer; Berlin/Heidelberg, Germany: 1983. pp. 453–482.
-
- Breiman L., Friedman J.H., Olshen R.A., Stone C.J. Classification and Regression Trees. Wadsworth and Brooks; Monterey, CA, USA: 1984.
-
- Boser B.E., Guyon I.M., Vapnik V.N. A training algorithm for optimal margin classifiers; Proceedings of the Fifth Annual Workshop on Computational Learning Theory; San Jose, CA, USA. 27–29 July 1992; pp. 144–152.
-
- Cortes C., Vapnik V. Support-vector networks. Mach. Learn. 1995;20:273–297. doi: 10.1007/BF00994018. - DOI