Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2015 Sep 7:6:276.
doi: 10.3389/fgene.2015.00276. eCollection 2015.

An introductory review of parallel independent component analysis (p-ICA) and a guide to applying p-ICA to genetic data and imaging phenotypes to identify disease-associated biological pathways and systems in common complex disorders

Affiliations
Review

An introductory review of parallel independent component analysis (p-ICA) and a guide to applying p-ICA to genetic data and imaging phenotypes to identify disease-associated biological pathways and systems in common complex disorders

Godfrey D Pearlson et al. Front Genet. .

Abstract

Complex inherited phenotypes, including those for many common medical and psychiatric diseases, are most likely underpinned by multiple genes contributing to interlocking molecular biological processes, along with environmental factors (Owen et al., 2010). Despite this, genotyping strategies for complex, inherited, disease-related phenotypes mostly employ univariate analyses, e.g., genome wide association. Such procedures most often identify isolated risk-related SNPs or loci, not the underlying biological pathways necessary to help guide the development of novel treatment approaches. This article focuses on the multivariate analysis strategy of parallel (i.e., simultaneous combination of SNP and neuroimage information) independent component analysis (p-ICA), which typically yields large clusters of functionally related SNPs statistically correlated with phenotype components, whose overall molecular biologic relevance is inferred subsequently using annotation software suites. Because this is a novel approach, whose details are relatively new to the field we summarize its underlying principles and address conceptual questions regarding interpretation of resulting data and provide practical illustrations of the method.

Keywords: common disease; common variant; genetic risk; imaging genetics; multivariate; parallel independent component analysis.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
Univariate approaches are focused on single points of relation whereas multivariate approaches like parallel ICA (p-ICA) focus on links between patterns (e.g., weighted combinations of brain regions and weighed combinations of genetic variables.
FIGURE 2
FIGURE 2
The benefit of a joint analysis is we can capitalize on the joint distribution of (in this case) the imaging and genetic data, something that can provide a better ability to discriminate health and disease. When we have two data sets, each with numerous variables, we could compute huge numbers of cross-correlations (adjusting for requisite multiple comparisons). Here, p-ICA displays a definite advantage, providing both a means to identify relationships among two very large data sets, while simultaneously identifying the most relevant variables representing this information, (i.e., simultaneously performing data reduction).
FIGURE 3
FIGURE 3
Weighted combinations of brain regions are linked to weighted combinations of genetic variables which can then be tested for associations with variables of interest (e.g., disease status, symptoms). Components extracted by p-ICA are a linear weighted combination of all variables. Each variable’s weight indicates its contribution to the component, and helps to interpret it. For instance, the genetic component, perhaps formed from thousands of SNP markers, is mainly contributed to by top-weighted markers. The remainder, with much lower weights do not markedly affect the component loading.
FIGURE 4
FIGURE 4
Weighted combinations of brain regions are linked to weighted combinations of genetic variables which can then be tested for associations with variables of interest (e.g., disease, symptoms).

Similar articles

Cited by

References

    1. Akaike H. (1974). A new look at the statistical model identification. IEEE Trans. Automat. Contr. 19 716–723. 10.1109/TAC.1974.1100705 - DOI
    1. Allen E., Damaraju E., Plis S. M., Erhardt E., Eichele T., Calhoun V. D. (2014). Tracking whole-brain connectivity dynamics in the resting state. Cereb. Cortex 24 663–676. 10.1093/cercor/bhs352 - DOI - PMC - PubMed
    1. American Psychiatric A., Force D. S. M. T. (2013). Diagnostic and Statistical Manual of Mental Disorders: DSM-5. Washington, DC: American Psychiatric Association.
    1. Anderson C. A., Pettersson F. H., Clarke G. M., Cardon L. R., Morris A. P., Zondervan K. T. (2010). Data quality control in genetic case-control association studies. Nat. Protoc. 5 1564–1573. 10.1038/nprot.2010.116 - DOI - PMC - PubMed
    1. Biswal B. B. (2012). Resting state fMRI: a personal history. Neuroimage 62 938–944. 10.1016/j.neuroimage.2012.01.090 - DOI - PubMed