Multi-Task Sparse Canonical Correlation Analysis with Application to Multi-Modal Brain Imaging Genetics

Lei Du, Kefei Liu, Xiaohui Yao, Shannon L Risacher, Junwei Han, Andrew J Saykin, Lei Guo, Li Shen

PMID: 31634139
PMCID: PMC7156329
DOI: 10.1109/TCBB.2019.2947428

Multi-Task Sparse Canonical Correlation Analysis with Application to Multi-Modal Brain Imaging Genetics

Lei Du et al. IEEE/ACM Trans Comput Biol Bioinform. 2021 Jan-Feb.

. 2021 Jan-Feb;18(1):227-239.

doi: 10.1109/TCBB.2019.2947428. Epub 2021 Feb 3.

Authors

Lei Du, Kefei Liu, Xiaohui Yao, Shannon L Risacher, Junwei Han, Andrew J Saykin, Lei Guo, Li Shen

PMID: 31634139
PMCID: PMC7156329
DOI: 10.1109/TCBB.2019.2947428

Abstract

Brain imaging genetics studies the genetic basis of brain structures and functionalities via integrating genotypic data such as single nucleotide polymorphisms (SNPs) and imaging quantitative traits (QTs). In this area, both multi-task learning (MTL) and sparse canonical correlation analysis (SCCA) methods are widely used since they are superior to those independent and pairwise univariate analysis. MTL methods generally incorporate a few of QTs and could not select features from multiple QTs; while SCCA methods typically employ one modality of QTs to study its association with SNPs. Both MTL and SCCA are computational expensive as the number of SNPs increases. In this paper, we propose a novel multi-task SCCA (MTSCCA) method to identify bi-multivariate associations between SNPs and multi-modal imaging QTs. MTSCCA could make use of the complementary information carried by different imaging modalities. MTSCCA enforces sparsity at the group level via the G_2,1-norm, and jointly selects features across multiple tasks for SNPs and QTs via the l_2,1-norm. A fast optimization algorithm is proposed using the grouping information of SNPs. Compared with conventional SCCA methods, MTSCCA obtains better correlation coefficients and canonical weights patterns. In addition, MTSCCA runs very fast and easy-to-implement, indicating its potential power in genome-wide brain-wide imaging genetics.

PubMed Disclaimer

Figures

**Fig. 1.**
Illustration of the pairwise correlation coefficients and LD values (r² ≥ 0.2) of SNPs from Chromosome 19 of an ADNI database. (1) The three sub figures above show the correlation coefficients r among SNPs with number of 1,000, and 5,000, and 13,000. (2) The three sub figures below are the corresponding values of LD. All figures show that SNPs clearly form groups and the block diagonal structure always exists as the number of SNPs increases.

**Fig. 2.**
Illustration of the simplified covariance matrix X^⊤X, where X_{g_k} and X_{g_k+1} are two LD blocks, and $X_{g_{k}}^{⊤} X_{g_{k}}$ is abbreviated as (X^⊤X)_{g_k}. Since the correlation between the two blocks are very low ( $X_{g_{k}}^{⊤} X_{g_{k + 1}} \approx 0$ and $X_{g_{k + 1}}^{⊤} X_{g_{k}} \approx 0$ ), their covariance can be ignored.

**Fig. 3.**
Canonical weights u (mean value) estimated on synthetic data. The first row is the ground truth, and each remaining row corresponds to an SCCA method: (1) Two-view SCCA, (2) mSCCA (Multi-view SCCA), (3) MTSCCA (Multi-task SCCA). In each subfigure, the horizontal axis represents the indices of each u, and the vertical axis represents the estimated weight value.

**Fig. 4.**
Canonical weights V (mean value) estimated on synthetic data. The first row is the ground truth, and each remaining row corresponds to an SCCA method: (1) Two-view SCCA, (2) mSCCA (Multi-view SCCA), (3) MTSCCA (Multi-task SCCA). In each subfigure, the horizontal axis represents the indices of v_j (j = 1, 2), and the vertical axis represents the estimated weight value.

**Fig. 5.**
Performance comparison: The mean and standard deviation (SD) of the canonical correlation coefficients (CCCs) obtained from 5-fold cross-validation trials are plotted, where each error bar indicates ±0.5SD. The subtitle SNPs-AV45 means the CCCs are calculated between the SNPs data and the AV45-PET data.

**Fig. 6.**
Comparison of canonical weights in terms of each imaging modality across five trials. Each row corresponds to a SCCA method: (1) Two-view SCCA; (2) mSCCA; (3) MTSCCA. Within each panel, there are three rows corresponding to three type of imaging QTs, i.e. AV45, FDG and VBM.

See this image and copyright information in PMC

References

1. Saykin AJ, Shen L, Yao X, Kim S, Nho K, and et al., “Genetic studies of quantitative MCI and AD phenotypes in ADNI: Progress, opportunities, and plans,” Alzheimer’s & Dementia, vol. 11, no. 7, pp. 792–814, 2015. - PMC - PubMed
1. Shen L, Thompson PM, Potkin SG, Bertram L, Farrer LA, and et al., “Genetic analysis of quantitative phenotypes in ad and mci: imaging, cognition and biomarkers,” Brain Imaging and Behavior, vol. 8, no. 2, pp. 183–207, 2014. - PMC - PubMed
1. Mueller SG, Weiner MW, Thal LJ, Petersen RC, Jack C, Jagust W, Trojanowski JQ, Toga AW, and Beckett L, “The alzheimer’s disease neuroimaging initiative,” Neuroimaging Clinics of North America, vol. 15, no. 4, pp. 869–877, 2005. - PMC - PubMed
1. Lee S, Zhu J, and Xing EP, “Adaptive multi-task lasso: with application to eqtl detection,” in NIPS, 2010, pp. 1306–1314.
1. Wang H, Nie F, Huang H, Kim S, Nho K, Risacher SL, Saykin AJ, and Shen L, “Identifying quantitative trait loci via group-sparse multitask regression and feature selection: an imaging genetics study of the ADNI cohort,” Bioinformatics, vol. 28, no. 2, pp. 229–237, 2012. - PMC - PubMed

Publication types

Actions
Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multi-Task Sparse Canonical Correlation Analysis with Application to Multi-Modal Brain Imaging Genetics

Multi-Task Sparse Canonical Correlation Analysis with Application to Multi-Modal Brain Imaging Genetics

Authors

Abstract

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials