Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Mar 4;38(6):1593-1599.
doi: 10.1093/bioinformatics/btab854.

BATL: Bayesian annotations for targeted lipidomics

Affiliations

BATL: Bayesian annotations for targeted lipidomics

Justin G Chitpin et al. Bioinformatics. .

Abstract

Motivation: Bioinformatic tools capable of annotating, rapidly and reproducibly, large, targeted lipidomic datasets are limited. Specifically, few programs enable high-throughput peak assessment of liquid chromatography-electrospray ionization tandem mass spectrometry data acquired in either selected or multiple reaction monitoring modes.

Results: We present here Bayesian Annotations for Targeted Lipidomics, a Gaussian naïve Bayes classifier for targeted lipidomics that annotates peak identities according to eight features related to retention time, intensity, and peak shape. Lipid identification is achieved by modeling distributions of these eight input features across biological conditions and maximizing the joint posterior probabilities of all peak identities at a given transition. When applied to sphingolipid and glycerophosphocholine selected reaction monitoring datasets, we demonstrate over 95% of all peaks are rapidly and correctly identified.

Availability and implementation: BATL software is freely accessible online at https://complimet.ca/batl/ and is compatible with Safari, Firefox, Chrome and Edge.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Common challenges associated with SRM, MRM and PRM peak identification. (a) Ambiguity occurs when multiple lipid isomers, isobars, and isotopes are detected within the same matrix at a given transition, yet technical variations in flow rate, composition of the mobile phase, temperature, pH, etc., cause their retention times to vary across samples. Data represent XICs of the same matrix (murine plasma) in animals fed different diets. Note six peaks are observed in one sample at a given transition. Seven peaks are observed in a different sample shifted by 1 min. Matching retention time would not align these shifted species. (b) Assigning lipid identities based on peak elution order (picking the nth eluting peak) will also lead to misidentifications when comparing lipid species across matrices. Data represent XICs of plasma and brain (temporal cortex) lipidomes from the same animal. Note both the retention time shift and the fundamentally different number of species within each matrix. Matching by either retention time or peak elution order would confound identification. (c) Matching lipids based on peak intensity features is complicated by pathological changes detected in lipid metabolism. Data represent XICs of the human plasma lipidome of patients with different neurodegenerative diseases. Note the marked change in abundances between conditions that impacts on lipid identification. While algorithms exist to address each of these challenges, few are applicable to datasets wherein all differences manifest simultaneously. BATL addresses these challenges
Fig. 2.
Fig. 2.
Schematic of the BATL lipid identification workflow. BATL follows three steps: (i) users are asked to identify training datasets for which they have unambiguous knowledge of peak identities. (ii) These datasets are used to train BATL, constructing a naïve Bayes statistical model based on the peak features users select. (iii) The model and associated metadata are used by the BATL algorithm to annotate peaks in subsequent query SRM, MRM or PRM datasets
Fig. 3.
Fig. 3.
Classifier performance on 10-fold cross validation sphingolipid and glycerophosphocholine datasets. The 95% confidence intervals are shown in panels (a, b, d and e). In a and b),data represent mean accuracies of BATL models trained on retention time with each decision rule and retention time mean/window matching algorithms for (a) sphingolipids or (b) glycerophosphocholines (***Q <0.001, t-test adjusted with the Benjamini–Hochberg method of all models against the MWBM decision rule). (c) Lipid assignment differences between MAP, constrained MAP, and MWBM decision rules during cross validation and trained using retention time. In the top panel, data represent the Gaussian likelihoods of five glycerophosphocholine isomers based on the retention time feature. The rows of gray dots indicate the retention times of four peaks from the same sample in the validation set. Each row indicates the outcome of the three decision rules. Arrows indicate the lipid assignments; checkmarks indicate correct assignments; and Xs indicate incorrect assignments. The numbers for constrained MAP indicate the order of peak assignments. In d and e,data represent mean accuracies of the BATL models using MWBM decision rule trained on several features and feature combinations for (d) sphingolipids or (e) glycerophosphocholines. The feature name codes are described in Table  1

References

    1. Alecu I., Bennett S.A.L. (2019) Dysregulated lipid metabolism and Its role in alpha-synucleinopathy in Parkinson's disease. Front. Neurosci., 13, 328. - PMC - PubMed
    1. Alshehry Z.H. et al. (2016) Plasma lipidomic profiles improve on traditional risk factors for the prediction of cardiovascular events in type 2 diabetes mellitus. Circulation, 134, 1637–1650. - PubMed
    1. Blasco H. et al. (2017) Lipidomics reveals cerebrospinal-fluid signatures of ALS. Sci. Rep., 7, 17652. - PMC - PubMed
    1. Bowden J.A. et al. (2017) Harmonizing lipidomics: NIST interlaboratory comparison exercise for lipidomics using SRM 1950-metabolites in frozen human plasma. J. Lipid Res., 58, 2275–2288. - PMC - PubMed
    1. Bowden J.A. et al. (2018) NIST lipidomics workflow questionnaire: an assessment of community-wide methodologies and perspectives. Metabolomics, 14, 53. - PMC - PubMed

Publication types