Transforming Big Data into AI-ready data for nutrition and obesity research
- PMID: 38426232
- PMCID: PMC11180473
- DOI: 10.1002/oby.23989
Transforming Big Data into AI-ready data for nutrition and obesity research
Abstract
Objective: Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions.
Methods: We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed.
Results: Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented.
Conclusions: Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.
© 2024 The Obesity Society. This article has been contributed to by U.S. Government employees and their work is in the public domain in the USA.
Conflict of interest statement
Figures
References
-
- Tilly C. The Old New Social History and the New Old Social History. Review (Fernand Braudel Center). 1984;7(3):363–406.
Publication types
MeSH terms
Grants and funding
- 1U24DK131617-01/NH/NIH HHS/United States
- T32 HD113301/HD/NICHD NIH HHS/United States
- U54TR004279/NH/NIH HHS/United States
- U24 CA268228/CA/NCI NIH HHS/United States
- UG1HD107688/NH/NIH HHS/United States
- P30 DK056350/DK/NIDDK NIH HHS/United States
- EW22-7278/US Department of Defense
- UG1 HD107688/HD/NICHD NIH HHS/United States
- U54 TR004279/TR/NCATS NIH HHS/United States
- U24 CA268153/CA/NCI NIH HHS/United States
- U24 DK131617/DK/NIDDK NIH HHS/United States
- U24 HD107676/HD/NICHD NIH HHS/United States
- U24CA268153/NH/NIH HHS/United States
LinkOut - more resources
Full Text Sources
Medical
