Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2013 Feb 15;433(2):102-4.
doi: 10.1016/j.ab.2012.10.011. Epub 2012 Oct 15.

Utilities for quantifying separation in PCA/PLS-DA scores plots

Affiliations

Utilities for quantifying separation in PCA/PLS-DA scores plots

Bradley Worley et al. Anal Biochem. .

Abstract

Metabolic fingerprinting studies rely on interpretations drawn from low-dimensional representations of spectral data generated by methods of multivariate analysis such as principal components analysis and projection to latent structures discriminant analysis. The growth of metabolic fingerprinting and chemometric analyses involving these low-dimensional scores plots necessitates the use of quantitative statistical measures to describe significant differences between experimental groups. Our updated version of the PCAtoTree software provides methods to reliably visualize and quantify separations in scores plots through dendrograms employing both nonparametric and parametric hypothesis testing to assess node significance, as well as scores plots identifying 95% confidence ellipsoids for all experimental groups.

PubMed Disclaimer

Figures

Figure 1
Figure 1
(a) 2D OPLS-DA scores plot illustrating 95% confidence ellipses for data having one predictive and one orthogonal PLS component. The symbol shape and color of each point correspond to the groups in Figure 2. Discrimination in the first component is between wild-type and antibiotic-treated Mycobacterium smegmatis, and separations along the second component indicate metabolic differences between various antibiotic treatments. The antibiotics cluster together based on a shared biological target (cell wall synthesis, mycolic acid biosynthesis, or transcription, translation and DNA supercoiling). Three compounds of unknown in vivo activity were shown to cluster together with inhibitors of cell wall synthesis inferring a potential biological target. Interestingly, the M. smegmatis strain is resistant to ampicillin resulting in the ampicillin-treated cells clustering closer to untreated cells. (b) 3D PCA scores plot with superimposed 95% confidence ellipsoids drawn as meshes containing group points. The ellipses and ellipsoids define the statistical significance of class separation and provide an illustration where two groups actually belong to the same biological classification. Group ‘SN’ refers to mock-transfected pancreatic cancer cells grown as a control group, while ‘SM’ refers to MUC1-overexpressing pancreatic cancer cells. Separations in scores space relate to metabolic differences in pancreatic cancer due to MUC1 overexpression.
Figure 2
Figure 2
(a) Dendrogram generated using Euclidean distances between group means from the OPLS-DA scores in Figure 1(a). Bootstrap statistics reported at each branch are for 5,000 bootstrap iterations. (b) Dendrogram generated from identical scores using Mahalanobis distances, with p -values for the null hypothesis reported at each branch.

References

    1. Gebregiworgis T, Powers R. Application of NMR Metabolomics to Search for Human Disease Biomarkers. Comb. Chem. High Throughput Screening. 2012;15:595–610. - PMC - PubMed
    1. Zhang B, Powers R. Using NMR-based metabolomics to study the regulation of biofilm formation. Future Med. Chem. 2012;4:1273–1306. - PMC - PubMed
    1. Werth MT, Halouska S, Shortridge MD, Zhang B, Powers R. Analysis of metabolomic PCA data using tree diagrams. Anal. Biochem. 2010;399:58–63. - PMC - PubMed
    1. Retief JD. Phylogenetic analysis using. Methods Mol. Biol. (Totowa, N. J.) 2000;132:243–258. - PubMed
    1. Jolliffe IT. Principal Component Analysis. New York: Springer; 2002.

Publication types