metaboprep: an R package for preanalysis data description and processing
- PMID: 35134881
- PMCID: PMC8963298
- DOI: 10.1093/bioinformatics/btac059
metaboprep: an R package for preanalysis data description and processing
Abstract
Motivation: Metabolomics is an increasingly common part of health research and there is need for preanalytical data processing. Researchers typically need to characterize the data and to exclude errors within the context of the intended analysis. Whilst some preprocessing steps are common, there is currently a lack of standardization and reporting transparency for these procedures.
Results: Here, we introduce metaboprep, a standardized data processing workflow to extract and characterize high quality metabolomics datasets. The package extracts data from preformed worksheets, provides summary statistics and enables the user to select samples and metabolites for their analysis based on a set of quality metrics. A report summarizing quality metrics and the influence of available batch variables on the data are generated for the purpose of open disclosure. Where possible, we provide users flexibility in defining their own selection thresholds.
Availability and implementation: metaboprep is an open-source R package available at https://github.com/MRCIEU/metaboprep.
Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author(s) 2022. Published by Oxford University Press.
Figures


Similar articles
-
AlpsNMR: an R package for signal processing of fully untargeted NMR-based metabolomics.Bioinformatics. 2020 May 1;36(9):2943-2945. doi: 10.1093/bioinformatics/btaa022. Bioinformatics. 2020. PMID: 31930381
-
Interoperable and scalable data analysis with microservices: applications in metabolomics.Bioinformatics. 2019 Oct 1;35(19):3752-3760. doi: 10.1093/bioinformatics/btz160. Bioinformatics. 2019. PMID: 30851093 Free PMC article.
-
MAFFIN: metabolomics sample normalization using maximal density fold change with high-quality metabolic features and corrected signal intensities.Bioinformatics. 2022 Jun 27;38(13):3429-3437. doi: 10.1093/bioinformatics/btac355. Bioinformatics. 2022. PMID: 35639662
-
CluMSID: an R package for similarity-based clustering of tandem mass spectra to aid feature annotation in metabolomics.Bioinformatics. 2019 Sep 1;35(17):3196-3198. doi: 10.1093/bioinformatics/btz005. Bioinformatics. 2019. PMID: 30649189
-
Metabolite-Investigator: an integrated user-friendly workflow for metabolomics multi-study analysis.Bioinformatics. 2021 Aug 9;37(15):2218-2220. doi: 10.1093/bioinformatics/btaa967. Bioinformatics. 2021. PMID: 33196775 Free PMC article.
Cited by
-
DNA methylation models of protein abundance across the lifecourse.Clin Epigenetics. 2024 Dec 21;16(1):189. doi: 10.1186/s13148-024-01802-y. Clin Epigenetics. 2024. PMID: 39709440 Free PMC article.
-
Using trials of caloric restriction and bariatric surgery to explore the effects of body mass index on the circulating proteome.Sci Rep. 2023 Nov 29;13(1):21077. doi: 10.1038/s41598-023-47030-x. Sci Rep. 2023. PMID: 38030643 Free PMC article.
-
Inflammation proteomics datasets in the ALSPAC cohort.Wellcome Open Res. 2024 Feb 6;7:277. doi: 10.12688/wellcomeopenres.18482.2. eCollection 2022. Wellcome Open Res. 2024. PMID: 39268475 Free PMC article.
-
Prediagnostic Plasma Nutrimetabolomics and Prostate Cancer Risk: A Nested Case-Control Analysis Within the EPIC Study.Cancers (Basel). 2024 Dec 8;16(23):4116. doi: 10.3390/cancers16234116. Cancers (Basel). 2024. PMID: 39682302 Free PMC article.
-
The metabolomic signature of weight loss and remission in the Diabetes Remission Clinical Trial (DiRECT).Diabetologia. 2024 Jan;67(1):74-87. doi: 10.1007/s00125-023-06019-x. Epub 2023 Oct 25. Diabetologia. 2024. PMID: 37878066 Free PMC article.
References
-
- Ala-Korpela M. (2015) Serum nuclear magnetic resonance spectroscopy: one more step toward clinical utility. Clin. Chem., 61, 681–683. - PubMed
-
- Barnes S. (2020) Overview of experimental methods and study design in metabolomics, and statistical and pathway considerations. Methods Mol. Biol., 2104, 1–10. - PubMed
-
- Begou O. et al. (2018) Quality control and validation issues in LC-MS metabolomics. Methods Mol. Biol., 1738, 15–26. - PubMed
Publication types
MeSH terms
Grants and funding
- 669545/ERC_/European Research Council/International
- NF-0616-10102/DH_/Department of Health/United Kingdom
- MC_PC_21038/MRC_/Medical Research Council/United Kingdom
- MC_PC_15018/MRC_/Medical Research Council/United Kingdom
- WT 217065/Z/19/Z/WT_/Wellcome Trust/United Kingdom
- MC_UU_00011/6/MRC_/Medical Research Council/United Kingdom
- FS/17/60/33474/BHF_/British Heart Foundation/United Kingdom
- WT101597MA/WT_/Wellcome Trust/United Kingdom
- 202802/Z/16/Z/WT_/Wellcome Trust/United Kingdom
- CH/F/20/90003/BHF_/British Heart Foundation/United Kingdom
- AA/18/7/34219/BHF_/British Heart Foundation/United Kingdom
- MR/N024397/1/MRC_/Medical Research Council/United Kingdom
- CS/16/4/32482/BHF_/British Heart Foundation/United Kingdom
- 29019/CRUK_/Cancer Research UK/United Kingdom
- MC_UU_00011/1/MRC_/Medical Research Council/United Kingdom
- C18281/A29019/CRUK_/Cancer Research UK/United Kingdom