Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Apr 15;36(8):2365-2374.
doi: 10.1093/bioinformatics/btaa004.

A U-statistics for integrative analysis of multilayer omics data

Affiliations

A U-statistics for integrative analysis of multilayer omics data

Xiaqiong Wang et al. Bioinformatics. .

Abstract

Motivation: The emerging multilayer omics data provide unprecedented opportunities for detecting biomarkers that are associated with complex diseases at various molecular levels. However, the high-dimensionality of multiomics data and the complex disease etiologies have brought tremendous analytical challenges.

Results: We developed a U-statistics-based non-parametric framework for the association analysis of multilayer omics data, where consensus and permutation-based weighting schemes are developed to account for various types of disease models. Our proposed method is flexible for analyzing different types of outcomes as it makes no assumptions about their distributions. Moreover, it explicitly accounts for various types of underlying disease models through weighting schemes and thus provides robust performance against them. Through extensive simulations and the application to dataset obtained from the Alzheimer's Disease Neuroimaging Initiatives, we demonstrated that our method outperformed the commonly used kernel regression-based methods.

Availability and implementation: The R-package is available at https://github.com/YaluWen/Uomic.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
Power under different correlations between genetic and methylation data (N = 500)
Fig. 2.
Fig. 2.
Power under different distributions of the outcomes (N =500 and ρ=0.1). Effect size scenarios: S1: G+M=0.04 for normal and G+M=0.08 for the other distributions. S2: G+M=0.05 for normal and G+M=0.1 for the other distributions. S3: G+M=0.1 for normal and G+M=0.2 for the other distributions. G denotes the genetic effects (i.e. βs) and M denotes the methylation effects (i.e. βm). G =0 (M =0) when there is no genetic (methylation) effects. G = M when both genetic and methylation have the same effects
Fig. 3.
Fig. 3.
QQ-plots for association analyses of four cognitive test scores
Fig. 4.
Fig. 4.
Manhattan plots for association analyses of four cognitive test scores

References

    1. Ashley E.A. (2015) The precision medicine initiative: a new national effort. JAMA, 313, 2119–2120. - PubMed
    1. Bersanelli M. et al. (2016) Methods for the integration of multi-omics data: mathematical aspects. BMC Bioinformatics, 17 (Suppl. 2), 15. - PMC - PubMed
    1. Burggren A.C. et al. (2017) Hippocampal thinning linked to longer TOMM40 poly-t variant lengths in the absence of the APOE epsilon4 variant. Alzheimers Dement., 13, 739–748. - PMC - PubMed
    1. Chalise P. et al. (2016) InterSIM: simulation tool for multiple integrative ‘omic datasets’. Comput. Methods Programs Biomed., 128, 69–74. - PMC - PubMed
    1. Chiba-Falek O. et al. (2018) The effects of the TOMM40 poly-t alleles on Alzheimer’s disease phenotypes. Alzheimers Dement., 14, 692–698. - PMC - PubMed

Publication types