Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review

Multivariate Statistical Methods for High-Dimensional Multiset Omics Data Analysis

In: Computational Biology [Internet]. Brisbane (AU): Codon Publications; 2019 Nov 21. Chapter 5.
Affiliations
Free Books & Documents
Review

Multivariate Statistical Methods for High-Dimensional Multiset Omics Data Analysis

Attila Csala et al.
Free Books & Documents

Excerpt

This chapter covers the state-of-the-art multivariate statistical methods designed for high-dimensional multiset omics data analysis. Recent biotechnological developments have enabled large-scale measurement of various biomolecular data, such as genotypic and phenotypic data, dispersed over various omics domains. An emergent research direction is to analyze these data sources using an integrated approach to better model and understand the underlying biology of complex disease conditions. However, comprehensive analysis techniques that can handle both the size and complexity, and at the same time can account for the hierarchical structure of such data, are lacking. An overview of some of the developments in multivariate techniques for high-dimensional omics data analysis, highlighting two well-known multivariate methods, canonical correlation analysis (CCA) and redundancy analysis (RDA), is provided in this chapter. Penalized versions of CCA are widespread in the omics data analysis field, and there is recent work on multiset penalized RDA that is applicable to multiset omics data. How these methods meet the statistical challenges that come with high-dimensional multiset omics data analysis and help to further our understanding of the human condition in terms of health and disease are presented. Additionally, the current challenges to be resolved in the field of omics data analysis are discussed.

PubMed Disclaimer

References

    1. Manzoni C, Kia DA, Vandrovcova J, Hardy J, Wood NW, Lewis PA, et al. Genome, transcriptome and proteome: The rise of omics data and their integration in biomedical sciences. Brief Bioinform. 2018 Mar 1;19(2):286–302. doi: 10.1093/bib/bbw114. - DOI - PMC - PubMed
    1. Berger B, Peng J, Singh M. Computational solutions for omics data. Nat Rev Genet. 2013 Apr 18;14(5):333–46. doi: 10.1038/nrg3433. - DOI - PMC - PubMed
    1. Langmead B, Nellore A. Cloud computing for genomic data analysis and collaboration. Nat Rev Genet. 2018 Jan 30;19(4):208–19. doi: 10.1038/nrg.2017.113. - DOI - PMC - PubMed
    1. Hasin Y, Seldin M, Lusis A. Multi-omics approaches to disease. Genome Biol. 2017 Dec 5;18(1):83. doi: 10.1186/s13059-017-1215-1. - DOI - PMC - PubMed
    1. Gallagher MD, Chen-Plotkin AS. The post-GWAS era: From association to function. Am J Hum Genet. 2018 May;102(5):717–30. doi: 10.1016/j.ajhg.2018.04.002. - DOI - PMC - PubMed

LinkOut - more resources