Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 1;25(4):1178-1194.
doi: 10.1093/biostatistics/kxae007.

Functional support vector machine

Affiliations

Functional support vector machine

Shanghong Xie et al. Biostatistics. .

Erratum in

  • Correction.
    [No authors listed] [No authors listed] Biostatistics. 2024 Dec 31;26(1):kxae029. doi: 10.1093/biostatistics/kxae029. Biostatistics. 2024. PMID: 39186534 Free PMC article. No abstract available.

Abstract

Linear and generalized linear scalar-on-function modeling have been commonly used to understand the relationship between a scalar response variable (e.g. continuous, binary outcomes) and functional predictors. Such techniques are sensitive to model misspecification when the relationship between the response variable and the functional predictors is complex. On the other hand, support vector machines (SVMs) are among the most robust prediction models but do not take account of the high correlations between repeated measurements and cannot be used for irregular data. In this work, we propose a novel method to integrate functional principal component analysis with SVM techniques for classification and regression to account for the continuous nature of functional data and the nonlinear relationship between the scalar response variable and the functional predictors. We demonstrate the performance of our method through extensive simulation experiments and two real data applications: the classification of alcoholics using electroencephalography signals and the prediction of glucobrassicin concentration using near-infrared reflectance spectroscopy. Our methods especially have more advantages when the measurement errors in functional predictors are relatively large.

Keywords: EEG; functional data analysis; functional principal component analysis; scalar-on-function modeling; support vector machine.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1:
Figure 1:
Simulation performance for binary outcomes of Scenarios 1–2. Left column: ηijN(0,1); right column: ηijN(0,10).
Figure 2:
Figure 2:
Simulation performance for binary outcomes of Scenarios 3–5. Left column: ηijN(0,1); right column: ηijN(0,10).
Figure 3:
Figure 3:
Simulation performance for continuous outcomes of Scenarios 1–4. Left column: ηijN(0,1); Right column: ηijN(0,10). : the RMSEs of SVR linear were large and exceeded the figure when N=50,J=40.
Figure 4:
Figure 4:
Waveform, classification accuracy, and AUC of EEG study. a) and b) Amplitude of the difference between matching condition and nonmatching condition of all subjects at P1 and P5 electrodes at 1 to 256 Hz, respectively. Solid lines: alcoholics; Dashed lines: controls. c) and d) Accuracy of classifying alcoholics and controls by P1 and P5 waveforms across 100 bootstrap samples. e) and f) AUC of classifying alcoholics and controls by P1 and P5 waveforms across 100 bootstrap samples.
Figure 5:
Figure 5:
NIR spectra and root mean square error (RMSE) of NIR study. a) NIR spectra of all samples. Solid line: NIR spectra for each sample; Dashed line: average NIR spectra over samples. b) and c) RMSE of predicting dry and fresh weight GBS concentrations across 100 bootstrap samples.

References

    1. Berlinet A, Thomas-Agnan C. Reproducing kernel Hilbert spaces in probability and statistics. New York, NY: Springer Science & Business Media; 2011.
    1. Besse P, Ramsay JO. Principal components analysis of sampled functions. Psychometrika. 1986:51:285–311.
    1. Bishop CM, Nasrabadi NM. Pattern recognition and machine learning. vol. 4. New York, NY: Springer; 2006.
    1. Caputo B, Sim K, Furesjo F, Smola A. Appearance-based object recognition using SVMs: which kernel should I use? In Proceedings of NIPS workshop on Statistical methods for computational experiments in visual processing and computer vision, Whistler. 2002.
    1. Font R, Del Rio-Celestino M, Rosa E, Aires A, De Haro-Bailon A. Glucosinolate assessment in brassica oleracea leaves by near-infrared spectroscopy. J Agric Sci. 2005:143(1):65–73.