Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2024 Mar 27;25(3):bbae213.
doi: 10.1093/bib/bbae213.

Towards multi-omics synthetic data integration

Affiliations
Review

Towards multi-omics synthetic data integration

Kumar Selvarajoo et al. Brief Bioinform. .

Abstract

Across many scientific disciplines, the development of computational models and algorithms for generating artificial or synthetic data is gaining momentum. In biology, there is a great opportunity to explore this further as more and more big data at multi-omics level are generated recently. In this opinion, we discuss the latest trends in biological applications based on process-driven and data-driven aspects. Moving ahead, we believe these methodologies can help shape novel multi-omics-scale cellular inferences.

Keywords: data-driven; machine learning; multi-omics; process-driven; synthetic data.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Synthetic data generation. (A) Process-driven. Left: for bulk cells an initial signaling model is developed using known mechanistic biochemical reactions and the corresponding experimental data for TRAIL signaling [6]. Right: the bulk validated TRAIL model is used to generate 1000 single cells data synthetically [8]. (B) Data-driven. Left: statistical models are used to general transcriptome-wide expression data scatter plots for bulk and single cells [10]. Right: a VAE-based model used to generate UMAP plots of 68 K single cells data, top (real) and bottom (synthetic using ACTIVA [11]).

References

    1. Giuffrè M, Shung DL. Harnessing the power of synthetic data in healthcare: innovation, application, and privacy. NPJ Digit Med 2023;6(1):186. - PMC - PubMed
    1. Servia-Rodriguez S, Wang L, Zhao J, et al. Privacy-Preserving Personal Model Training. In: 2018 IEEE/ACM Third International Conference on Internet-of-Things Design and Implementation (IoTDI), Orlando, FL, IEEE, NJ, USA, 2018, 153–64.
    1. Toussaint PA, Leiser F, Thiebes S, et al. Explainable artificial intelligence for omics data: a systematic mapping study. Brief Bioinform 2023;25(1):bbad453. - PMC - PubMed
    1. Goncalves A, Ray P, Soper B, et al. Generation and evaluation of synthetic patient data. BMC Med Res Methodol 2020;20(1):108. - PMC - PubMed
    1. Helmy M, Smith D, Selvarajoo K. Systems biology approaches integrated with artificial intelligence for optimized metabolic engineering. Metab Eng Commun 2020;11:e00149. - PMC - PubMed

Publication types