Joint analysis of expression levels and histological images identifies genes associated with tissue morphology
- PMID: 33707455
- PMCID: PMC7952575
- DOI: 10.1038/s41467-021-21727-x
Joint analysis of expression levels and histological images identifies genes associated with tissue morphology
Abstract
Histopathological images are used to characterize complex phenotypes such as tumor stage. Our goal is to associate features of stained tissue images with high-dimensional genomic markers. We use convolutional autoencoders and sparse canonical correlation analysis (CCA) on paired histological images and bulk gene expression to identify subsets of genes whose expression levels in a tissue sample correlate with subsets of morphological features from the corresponding sample image. We apply our approach, ImageCCA, to two TCGA data sets, and find gene sets associated with the structure of the extracellular matrix and cell wall infrastructure, implicating uncharacterized genes in extracellular processes. We find sets of genes associated with specific cell types, including neuronal cells and cells of the immune system. We apply ImageCCA to the GTEx v6 data, and find image features that capture population variation in thyroid and in colon tissues associated with genetic variants (image morphology QTLs, or imQTLs), suggesting that genetic variation regulates population variation in tissue morphological traits.
Conflict of interest statement
B.E.E. is on the SAB of Freenome, Celsius Therapeutics, and Creyon Bio, and is a consultant for Genomics plc and Freenome. The remaining authors have no competing interests to declare.
Figures






References
-
- Demir, C. & Yener, B. Automated Cancer Diagnosis Based on Histopathological Images: A Systematic Survey. Technical Report (Rensselaer Polytechnic Institute, 2005).
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Medical
Miscellaneous