Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2021 Jan 2:1141:144-162.
doi: 10.1016/j.aca.2020.10.038. Epub 2020 Oct 22.

Multi-omics integration in biomedical research - A metabolomics-centric review

Affiliations
Review

Multi-omics integration in biomedical research - A metabolomics-centric review

Maria A Wörheide et al. Anal Chim Acta. .

Abstract

Recent advances in high-throughput technologies have enabled the profiling of multiple layers of a biological system, including DNA sequence data (genomics), RNA expression levels (transcriptomics), and metabolite levels (metabolomics). This has led to the generation of vast amounts of biological data that can be integrated in so-called multi-omics studies to examine the complex molecular underpinnings of health and disease. Integrative analysis of such datasets is not straightforward and is particularly complicated by the high dimensionality and heterogeneity of the data and by the lack of universal analysis protocols. Previous reviews have discussed various strategies to address the challenges of data integration, elaborating on specific aspects, such as network inference or feature selection techniques. Thereby, the main focus has been on the integration of two omics layers in their relation to a phenotype of interest. In this review we provide an overview over a typical multi-omics workflow, focusing on integration methods that have the potential to combine metabolomics data with two or more omics. We discuss multiple integration concepts including data-driven, knowledge-based, simultaneous and step-wise approaches. We highlight the application of these methods in recent multi-omics studies, including large-scale integration efforts aiming at a global depiction of the complex relationships within and between different biological layers without focusing on a particular phenotype.

Keywords: Data integration; Lipidomics; Metabolomics; Multi-omics; Systems biology.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Figures

Figure 1.
Figure 1.. Multi-omics workflow.
A typical multi-omics analysis can generally be broken down into 4 steps. (i) Data generation. Study design, sample preparation and subsequent data acquisition through high-throughput analytical platforms lead to different data scenarios. (ii) Data preprocessing and dimensionality reduction. Raw data collected on different omics layers is preprocessed appropriately and dimensionality reduction can be applied to reduce the number of variables (measured biological entities). (iii) Data integration. Data from different omics layers are analyzed and integrated using data-driven, knowledge-based or hybrid integration approaches. The choice of method depends on the input data and research question of interest. (iv) Data interpretation. Post-integration visualization and analysis of the integration results (e.g., statistical model or network) can identify novel biomarker candidates, generate testable hypothesis or reveal meaningful biological relationships.
Figure 2.
Figure 2.. Multi-omics integration through composite networks.
A. Different layers of a biological system that can be profiled using high-throughput technologies and are frequently integrated in multi-omics studies. B. Simultaneous integration. Correlation structures within and across omics datasets are analyzed using statistical methods. C. QTL-based integration. Using the genome as an anchor, quantitative trait loci (QTLs) identified in genome wide association studies (GWASs) are overlaid to establish links between different omics layers. D Knowledge integration. External information from metabolic databases or scientific literature is used to establish relationships between biological entities. E. Composite networks. By merging the networks inferred in (B-D) on common entities, comprehensive multi-omics catalogues can be constructed. These heterogenous networks can be mined in post-integration analysis using established graph algorithms.

References

    1. Lotta LA, Scott RA, Sharp SJ, Burgess S, Luan J, Tillin T, Schmidt AF, Imamura F, Stewart ID, Perry JRB, Marney L, Koulman A, Karoly ED, Forouhi NG, Sjögren RJO, Näslund E, Zierath JR, Krook A, Savage DB, Griffin JL, Chaturvedi N, Hingorani AD, Khaw KT, Barroso I, McCarthy MI, O’Rahilly S, Wareham NJ, Langenberg C, Genetic Predisposition to an Impaired Metabolism of the Branched-Chain Amino Acids and Risk of Type 2 Diabetes: A Mendelian Randomisation Analysis, PLoS Med. 13 (2016) 1–22. 10.1371/journal.pmed.1002179. - DOI - PMC - PubMed
    1. Tynkkynen J, Chouraki V, van der Lee SJ, Hernesniemi J, Yang Q, Li S, Beiser A, Larson MG, Sääksjärvi K, Shipley MJ, Singh-Manoux A, Gerszten RE, Wang TJ, Havulinna AS, Würtz P, Fischer K, Demirkan A, Ikram MA, Amin N, Lehtimäki T, Kähönen M, Perola M, Metspalu A, Kangas AJ, Soininen P, Ala-Korpela M, Vasan RS, Kivimäki M, van Duijn CM, Seshadri S, Salomaa V, Association of branched-chain amino acids and other circulating metabolites with risk of incident dementia and Alzheimer’s disease: A prospective study in eight cohorts, Alzheimer’s Dement. 14 (2018) 723–733. 10.1016/j.jalz.2018.01.003. - DOI - PMC - PubMed
    1. Civelek M, Lusis AJ, Systems genetics approaches to understand complex traits, Nat. Rev. Genet 15 (2014) 34–48. 10.1038/nrg3575. - DOI - PMC - PubMed
    1. Hasin Y, Seldin M, Lusis A, Multi-omics approaches to disease, Genome Biol. 18 (2017) 83 10.1186/s13059-017-1215-1. - DOI - PMC - PubMed
    1. Smith GD, Ebrahim S, “Mendelian randomization”: Can genetic epidemiology contribute to understanding environmental determinants of disease?, Int. J. Epidemiol 32 (2003) 1–22. 10.1093/ije/dyg070. - DOI - PubMed

LinkOut - more resources