Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2017 Jun 22:6:967.
doi: 10.12688/f1000research.11823.1. eCollection 2017.

Analytical challenges of untargeted GC-MS-based metabolomics and the critical issues in selecting the data processing strategy

Affiliations

Analytical challenges of untargeted GC-MS-based metabolomics and the critical issues in selecting the data processing strategy

Ting-Li Han et al. F1000Res. .

Abstract

Background: A challenge of metabolomics is data processing the enormous amount of information generated by sophisticated analytical techniques. The raw data of an untargeted metabolomic experiment are composited with unwanted biological and technical variations that confound the biological variations of interest. The art of data normalisation to offset these variations and/or eliminate experimental or biological biases has made significant progress recently. However, published comparative studies are often biased or have omissions. Methods: We investigated the issues with our own data set, using five different representative methods of internal standard-based, model-based, and pooled quality control-based approaches, and examined the performance of these methods against each other in an epidemiological study of gestational diabetes using plasma. Results: Our results demonstrated that the quality control-based approaches gave the highest data precision in all methods tested, and would be the method of choice for controlled experimental conditions. But for our epidemiological study, the model-based approaches were able to classify the clinical groups more effectively than the quality control-based approaches because of their ability to minimise not only technical variations, but also biological biases from the raw data. Conclusions: We suggest that metabolomic researchers should optimise and justify the method they have chosen for their experimental condition in order to obtain an optimal biological outcome.

Keywords: Biomarker discovery; Gas chromatography-mass spectrometry; Gestational diabetes; Metabolomics; Normalisation method.

PubMed Disclaimer

Conflict of interest statement

Competing interests: No competing interests were disclosed.

Figures

Figure 1.
Figure 1.
Agilent MassHunter ( a) Qualitative Workflows and ( b) Profinder interface. 385 components were extracted from a typical QC sample from 14.5 to 56 min, of which 62 were confidently annotated with match factor ≥ 80. Data was then exported to a CEF file. The file was then used by Profinder for batch data extraction. The Profinder tool was designed with the use of reference spectra and retention time windows to assist data extraction.
Figure 2.
Figure 2.
Multilevel principal component analysis score plots produced by the data processed with the ( a) Eigen, ( b) PQN, and ( c) LOWESS normalisation.
Figure 3.
Figure 3.. Multilevel principal component analysis PC1 loading plots (top 10 variables) corresponding to Figure 2.
( a) Eigen, ( b) PQN, and ( c) LOWESS normalisation.
Figure 4.
Figure 4.. Heat map of are under the curve (ROC) values of 62 putative metabolites.

References

    1. Mizuno H, Ueda K, Kobayashi Y, et al. : The great importance of normalization of LC-MS data for highly-accurate non-targeted metabolomics. Biomed Chromatogr. 2017;31(1): e3864. 10.1002/bmc.3864 - DOI - PubMed
    1. Lind MV, Savolainen OI, Ross AB: The use of mass spectrometry for analysing metabolite biomarkers in epidemiology: methodological and statistical considerations for application to large numbers of biological samples. Eur J Epidemiol. 2016;31(8):717–33. 10.1007/s10654-016-0166-2 - DOI - PubMed
    1. Filzmoser P, Walczak B: What can go wrong at the data normalization step for identification of biomarkers? J Chromatogr A. 2014;1362:194–205. 10.1016/j.chroma.2014.08.050 - DOI - PubMed
    1. Wu Y, Li L: Sample normalization methods in quantitative metabolomics. J Chromatogr A. 2016;1430:80–95. 10.1016/j.chroma.2015.12.007 - DOI - PubMed
    1. De Livera AM, Sysi-Aho M, Jacob L, et al. : Statistical methods for handling unwanted variation in metabolomics data. Anal Chem. 2015;87(7):3606–15. 10.1021/ac502439y - DOI - PMC - PubMed

LinkOut - more resources