Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Sep;48(9):1094-100.
doi: 10.1038/ng.3624. Epub 2016 Aug 1.

Tensor decomposition for multiple-tissue gene expression experiments

Affiliations

Tensor decomposition for multiple-tissue gene expression experiments

Victoria Hore et al. Nat Genet. 2016 Sep.

Abstract

Genome-wide association studies of gene expression traits and other cellular phenotypes have successfully identified links between genetic variation and biological processes. The majority of discoveries have uncovered cis-expression quantitative trait locus (eQTL) effects via mass univariate testing of SNPs against gene expression in single tissues. Here we present a Bayesian method for multiple-tissue experiments focusing on uncovering gene networks linked to genetic variation. Our method decomposes the 3D array (or tensor) of gene expression measurements into a set of latent components. We identify sparse gene networks that can then be tested for association against genetic variation across the genome. We apply our method to a data set of 845 individuals from the TwinsUK cohort with gene expression measured via RNA-seq analysis in adipose, lymphoblastoid cell lines (LCLs) and skin. We uncover several gene networks with a genetic basis and clear biological and statistical significance. Extensions of this approach will allow integration of different omics, environmental and phenotypic data sets.

PubMed Disclaimer

Figures

Figure 1
Figure 1. Graphical representation of the method.
The gene expression data tensor (top left) is decomposed into the product of an individual scores matrix, a tissue scores matrix and a gene loadings matrix (top right). Columns of the individual scores matrix are then used as phenotypes in a GWAS using SNP genotypes (bottom left) in order to uncover genetic variation correlated with the latent components.
Figure 2
Figure 2. MHC Class II regulation.
Figures a and b shows two components identifying a similar network in different tissues. (Top left) GWAS with the component’s individual scores vector as a phenotype. (Top right) Boxplot of individual scores stratified by genotypes at the lead GWAS SNP. Boxplots show the median, upper and lower quartiles, with whiskers extending to either 1.5 times the interquartile range (IQR), or to the most extreme data point if this is within 1.5 times IQR. (Bottom left) Gene loadings for the component. Only gene loadings with a PIP>0.5 are shown. (Bottom right) Tissue scores vector for the component shown as a barplot.
Figure 3
Figure 3. MHC Class I regulation.
(Top left) GWAS with the component’s individual scores vector as a phenotype. (Top right) Boxplot of individual scores stratified by genotypes at the lead GWAS SNP rs289749. (Bottom left) Gene loadings for the component. Only gene loadings with a PIP>0.5 are shown. (Bottom right) Tissue scores vector for the component shown as a barplot.
Figure 4
Figure 4. Histone RNA processing.
(Top left) GWAS with the component’s individual scores vector as a phenotype. (Top right) Boxplot of individual scores stratified by genotypes at the lead GWAS SNP rs6882616. (Bottom left) Gene loadings for the component. Only gene loadings with a PIP>0.5 are shown. (Bottom right) Tissue scores vector for the component shown as a barplot.
Figure 5
Figure 5. Type I Interferon Response.
(Top left) GWAS with the component’s individual scores vector as a phenotype. (Top right) Boxplot of individual scores stratified by genotypes at the lead GWAS SNP rs2401506. (Bottom left) Gene loadings for the component. Only gene loadings with a PIP>0.5 are shown. (Bottom right) Tissue scores vector for the component shown as a barplot.
Figure 6
Figure 6. Zinc finger gene network.
(Top left) GWAS with the component’s individual scores vector as a phenotype. (Top right) Boxplots of individual scores stratified by genotypes at the lead GWAS SNPs, rs17611866 and rs12630796. (Bottom left) Gene loadings for the component, with zinc finger genes on chr 19 highlighted in purple. Only gene loadings with a PIP>0.5 are shown. (Bottom right) Tissue scores vector for the component shown as a barplot.
Figure 7
Figure 7. Multi-omics data integration.
Graphical representation of a linked decomposition for several genomic assays. A matrix decomposition is applied to each data type. The matrix decompositions identify a different loadings matrix for each data type and a shared individual scores matrix.

References

    1. Stranger BE, et al. Population genomics of human gene expression. Nat Genet. 2007;39:1217–1224. - PMC - PubMed
    1. Degner JF, et al. DNase I sensitivity QTLs are a major determinant of human expression variation. Nature. 2012;482:390–394. - PMC - PubMed
    1. Kasowski M, et al. Extensive variation in chromatin states across humans. Science. 2013;342:750–752. - PMC - PubMed
    1. Battle A, et al. Genomic variation. Impact of regulatory variation from RNA to protein. Science. 2015;347:664–667. - PMC - PubMed
    1. Pai AA, Pritchard JK, Gilad Y. The genetic and mechanistic basis for variation in gene regulation. PLoS Genet. 2015;11:e1004857. - PMC - PubMed

Publication types

Substances