Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Feb 19:9:106.
doi: 10.1186/1471-2105-9-106.

K-OPLS package: kernel-based orthogonal projections to latent structures for prediction and interpretation in feature space

Affiliations

K-OPLS package: kernel-based orthogonal projections to latent structures for prediction and interpretation in feature space

Max Bylesjö et al. BMC Bioinformatics. .

Abstract

Background: Kernel-based classification and regression methods have been successfully applied to modelling a wide variety of biological data. The Kernel-based Orthogonal Projections to Latent Structures (K-OPLS) method offers unique properties facilitating separate modelling of predictive variation and structured noise in the feature space. While providing prediction results similar to other kernel-based methods, K-OPLS features enhanced interpretational capabilities; allowing detection of unanticipated systematic variation in the data such as instrumental drift, batch variability or unexpected biological variation.

Results: We demonstrate an implementation of the K-OPLS algorithm for MATLAB and R, licensed under the GNU GPL and available at http://www.sourceforge.net/projects/kopls/. The package includes essential functionality and documentation for model evaluation (using cross-validation), training and prediction of future samples. Incorporated is also a set of diagnostic tools and plot functions to simplify the visualisation of data, e.g. for detecting trends or for identification of outlying samples. The utility of the software package is demonstrated by means of a metabolic profiling data set from a biological study of hybrid aspen.

Conclusion: The properties of the K-OPLS method are well suited for analysis of biological data, which in conjunction with the availability of the outlined open-source package provides a comprehensive solution for kernel-based analysis in bioinformatics applications.

PubMed Disclaimer

Figures

Figure 1
Figure 1
K-OPLS model properties of the NMR-based metabolic profiling data set. Each point represents a measured observation (biological sample).The size of each glyph in the figure is proportional to the internode number 1–8, denoting a growth gradient. In (A), the K-OPLS predictive score vector t1p is plotted against the first Y-orthogonal score vector t1o. In (B), the first K-OPLS Y-orthogonal score vector t1o is plotted against the second Y-orthogonal score vector t2o. An approximate joint internode gradient, formed by a linear combination of both vectors, is shown using the dashed arrow. In (C), the first K-OPLS Y-orthogonal score vector t1o is plotted against the second Y-orthogonal score vector t2o only for the mutant samples, colour-coded by biological replicate. Biological replicate A displays a deviating behaviour compared to biological replicates B and C; trajectory shown by the dashed line. In (D), the first KPLS latent variable t1 is plotted against the second latent variable t2. The discriminatory direction is now a linear combination of both of the latent variables.

References

    1. Trygg J, Wold S. Orthogonal projections to latent structures (O-PLS) J Chemometrics. 2002;16:119–128. doi: 10.1002/cem.695. - DOI
    1. Trygg J, Wold S. O2-PLS, a two-block (X-Y) latent variable regression (LVR) method with an integral OSC filter. J Chemometrics. 2003;17:53–64. doi: 10.1002/cem.775. - DOI
    1. Bylesjö M, Eriksson D, Sjödin A, Jansson S, Moritz T, Trygg J. Orthogonal Projections to Latent Structures as a Strategy for Microarray Data Normalization. BMC Bioinformatics. 2007;8:207. doi: 10.1186/1471-2105-8-207. - DOI - PMC - PubMed
    1. Bylesjö M, Rantalainen M, Cloarec O, Nicholson JK, Holmes E, Trygg J. OPLS discriminant analysis: combining the strengths of PLS-DA and SIMCA classification. J Chemometrics. 2006;20:341–351. doi: 10.1002/cem.1006. - DOI
    1. Cloarec O, Dumas ME, Trygg J, Craig A, Barton RH, Lindon JC, Nicholson JK, Holmes E. Evaluation of the orthogonal projection on latent structure model limitations caused by chemical shift variability and improved visualization of biomarker changes in 1H NMR spectroscopic metabonomic studies. Anal Chem. 2005;77:517–526. doi: 10.1021/ac048803i. - DOI - PubMed

Publication types

LinkOut - more resources