Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 May;32(5):857-870.
doi: 10.1002/oby.23989. Epub 2024 Mar 1.

Transforming Big Data into AI-ready data for nutrition and obesity research

Affiliations
Review

Transforming Big Data into AI-ready data for nutrition and obesity research

Diana M Thomas et al. Obesity (Silver Spring). 2024 May.

Abstract

Objective: Big Data are increasingly used in obesity and nutrition research to gain new insights and derive personalized guidance; however, this data in raw form are often not usable. Substantial preprocessing, which requires machine learning (ML), human judgment, and specialized software, is required to transform Big Data into artificial intelligence (AI)- and ML-ready data. These preprocessing steps are the most complex part of the entire modeling pipeline. Understanding the complexity of these steps by the end user is critical for reducing misunderstanding, faulty interpretation, and erroneous downstream conclusions.

Methods: We reviewed three popular obesity/nutrition Big Data sources: microbiome, metabolomics, and accelerometry. The preprocessing pipelines, specialized software, challenges, and how decisions impact final AI- and ML-ready products were detailed.

Results: Opportunities for advances to improve quality control, speed of preprocessing, and intelligent end user consumption were presented.

Conclusions: Big Data have the exciting potential for identifying new modifiable factors that impact obesity research. However, to ensure accurate interpretation of conclusions arising from Big Data, the choices involved in preparing AI- and ML-ready data need to be transparent to investigators and clinicians relying on the conclusions.

PubMed Disclaimer

Conflict of interest statement

Conflicts of Interest: The authors declare no conflicts of interest.

Figures

Figure 1:
Figure 1:
Pipeline flow chart depicting the transformation from raw microbiome data to data that is AI ready.
Figure 2a:
Figure 2a:
Pipeline flow chart depicting the transformation from raw mass spectrometry metabolomics data to data that is AI ready (EIC= extracted ion chromatogram).
Figure 2b:
Figure 2b:
Pipeline flow chart depicting the transformation from NMR spectroscopy metabolomics data to data that is AI ready. Data matrix can be prepared using binned data or annotated/ library matched data. For binned data approach, library matching/ database search is performed on select important NMR bins.
Figure 3
Figure 3
Pipeline flow chart depicting the transformation from raw accelerometry data to data that is AI ready for physical activity analysis.

References

    1. Zeevi D, Korem T, Zmora N, Israeli D, Rothschild D, Weinberger A, et al. Personalized Nutrition by Prediction of Glycemic Responses. Cell. 2015;163(5):1079–94. Epub 2015/11/23. doi: 10.1016/j.cell.2015.11.001. - DOI - PubMed
    1. Berry SE, Valdes AM, Drew DA, Asnicar F, Mazidi M, Wolf J, et al. Human postprandial responses to food and potential for precision nutrition. Nature Medicine. 2020;26(6):964–73. doi: 10.1038/s41591-020-0934-0. - DOI - PMC - PubMed
    1. Corbin KD, Carnero EA, Dirks B, Igudesman D, Yi F, Marcus A, et al. Host-diet-gut microbiome interactions influence human energy balance: a randomized clinical trial. Nature Communications. 2023;14(1):3161. doi: 10.1038/s41467-023-38778-x. - DOI - PMC - PubMed
    1. Shen X, Kellogg R, Panyard DJ, Bararpour N, Castillo KE, Lee-McMullen B, et al. Multi-omics microsampling for the profiling of lifestyle-associated changes in health. Nature Biomedical Engineering. 2023. doi: 10.1038/s41551-022-00999-8. - DOI - PMC - PubMed
    1. Tilly C. The Old New Social History and the New Old Social History. Review (Fernand Braudel Center). 1984;7(3):363–406.

Publication types

LinkOut - more resources