Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Aug 15:166:66-73.
doi: 10.1016/j.ymeth.2019.03.004. Epub 2019 Mar 7.

Unsupervised classification of multi-omics data during cardiac remodeling using deep learning

Affiliations

Unsupervised classification of multi-omics data during cardiac remodeling using deep learning

Neo Christopher Chung et al. Methods. .

Abstract

Integration of multi-omics in cardiovascular diseases (CVDs) presents high potentials for translational discoveries. By analyzing abundance levels of heterogeneous molecules over time, we may uncover biological interactions and networks that were previously unidentifiable. However, to effectively perform integrative analysis of temporal multi-omics, computational methods must account for the heterogeneity and complexity in the data. To this end, we performed unsupervised classification of proteins and metabolites in mice during cardiac remodeling using two innovative deep learning (DL) approaches. First, long short-term memory (LSTM)-based variational autoencoder (LSTM-VAE) was trained on time-series numeric data. The low-dimensional embeddings extracted from LSTM-VAE were then used for clustering. Second, deep convolutional embedded clustering (DCEC) was applied on images of temporal trends. Instead of a two-step procedure, DCEC performes a joint optimization for image reconstruction and cluster assignment. Additionally, we performed K-means clustering, partitioning around medoids (PAM), and hierarchical clustering. Pathway enrichment analysis using the Reactome knowledgebase demonstrated that DL methods yielded higher numbers of significant biological pathways than conventional clustering algorithms. In particular, DCEC resulted in the highest number of enriched pathways, suggesting the strength of its unified framework based on visual similarities. Overall, unsupervised DL is shown to be a promising analytical approach for integrative analysis of temporal multi-omics.

Keywords: Cardiovascular; Clustering; Integrative analysis; Multi-omics; Time-series; Unsupervised deep learning.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.. Integrative Clustering Workflow for Temporal Multi-Omics Data.
Missing values in proteomics and metabolomics datasets are completed by fully conditional specification (FCS)-based multiple imputations. Cubic splines were employed to smooth the temporal trends. Images were also generated based on the temporal trends over 14 days. As a baseline, K-means clustering, hierarchical clustering (HC), partitioning around medoids (PAM)) were employed. In contrast, two deep neural network architectures for unsupervised learning were also implemented. First, long short-term memory (LSTM)-based variational autoencoder (LSTM-VAE) was used to extract low-dimensional embeddings from the time-series numeric data and K-means clustering was employed on the embeddings. Second, deep convolutional embedded clustering (DCEC) was employed to perform clustering of molecules based on image data. The results obtained from the clustering methods were fed to Reactome knowledgebase for pathway enrichments analyses.
Figure 2:
Figure 2:. 3D Visualization of 6 clusters in the Embedded Feature Space of LSTM-VAE.
Three axes represent the three dimensions of the latent space obtained from LSTM-VAE. Different colors denote six clusters identified by K-means clustering. The ellipsoids represent 80% of the concentration for each cluster.
Figure 3:
Figure 3:. t-SNE Visualization of DCEC Embeddings.
The t-SNE visualization of the embeddings, optimized for both reconstruction and cluster losses, clearly shows the separation of samples into six clusters. Each cluster obtained by DCEC is represented by a different color.
Figure 4.
Figure 4.. Cluster Centers for Different Clustering Methods.
Cluster centers for the each of the 6 clusters obtained by K-means, HC, PAM, LSTM-VAE and DCECE methods are plotted. Based on the temporal similarities of cluster centers across methods, we labeled the clusters as increase, decrease, increase-decrease, decrease-increase, late increase and late decrease. However, there are some obvious differences, which can be attributed to the differences in learning algorithms and input data types (e.g., time-series numeric or image).
Figure 5.
Figure 5.. Shared and Unique Molecules Across Different Clustering Methods in Merged Clusters.
In each merged cluster, majority of the molecules are shared by at least three clustering methods. All the methods have some unique molecules in each cluster due to differences in learning algorithm and/or input data type. Hierarchal clustering generally has a high concentration of unique molecules while LSTM-VAE has the least.

Similar articles

Cited by

  • Guidelines for mechanistic modeling and analysis in cardiovascular research.
    Colebank MJ, Oomen PA, Witzenburg CM, Grosberg A, Beard DA, Husmeier D, Olufsen MS, Chesler NC. Colebank MJ, et al. Am J Physiol Heart Circ Physiol. 2024 Aug 1;327(2):H473-H503. doi: 10.1152/ajpheart.00766.2023. Epub 2024 Jun 21. Am J Physiol Heart Circ Physiol. 2024. PMID: 38904851 Free PMC article. Review.
  • Identifying the Socioeconomic, Demographic, and Political Determinants of Social Mobility and Their Effects on COVID-19 Cases and Deaths: Evidence From US Counties.
    Jalali N, Tran NK, Sen A, Morita PP. Jalali N, et al. JMIR Infodemiology. 2022 Mar 3;2(1):e31813. doi: 10.2196/31813. eCollection 2022 Jan-Jun. JMIR Infodemiology. 2022. PMID: 35287305 Free PMC article.
  • Discovery of drug-omics associations in type 2 diabetes with generative deep-learning models.
    Allesøe RL, Lundgaard AT, Hernández Medina R, Aguayo-Orozco A, Johansen J, Nissen JN, Brorsson C, Mazzoni G, Niu L, Biel JH, Leal Rodríguez C, Brasas V, Webel H, Benros ME, Pedersen AG, Chmura PJ, Jacobsen UP, Mari A, Koivula R, Mahajan A, Vinuela A, Tajes JF, Sharma S, Haid M, Hong MG, Musholt PB, De Masi F, Vogt J, Pedersen HK, Gudmundsdottir V, Jones A, Kennedy G, Bell J, Thomas EL, Frost G, Thomsen H, Hansen E, Hansen TH, Vestergaard H, Muilwijk M, Blom MT, 't Hart LM, Pattou F, Raverdy V, Brage S, Kokkola T, Heggie A, McEvoy D, Mourby M, Kaye J, Hattersley A, McDonald T, Ridderstråle M, Walker M, Forgie I, Giordano GN, Pavo I, Ruetten H, Pedersen O, Hansen T, Dermitzakis E, Franks PW, Schwenk JM, Adamski J, McCarthy MI, Pearson E, Banasik K, Rasmussen S, Brunak S; IMI DIRECT Consortium. Allesøe RL, et al. Nat Biotechnol. 2023 Mar;41(3):399-408. doi: 10.1038/s41587-022-01520-x. Epub 2023 Jan 2. Nat Biotechnol. 2023. PMID: 36593394 Free PMC article.
  • Cardiovascular informatics: building a bridge to data harmony.
    Caufield JH, Sigdel D, Fu J, Choi H, Guevara-Gonzalez V, Wang D, Ping P. Caufield JH, et al. Cardiovasc Res. 2022 Feb 21;118(3):732-745. doi: 10.1093/cvr/cvab067. Cardiovasc Res. 2022. PMID: 33751044 Free PMC article. Review.
  • Depression and suicide risk prediction models using blood-derived multi-omics data.
    Bhak Y, Jeong HO, Cho YS, Jeon S, Cho J, Gim JA, Jeon Y, Blazyte A, Park SG, Kim HM, Shin ES, Paik JW, Lee HW, Kang W, Kim A, Kim Y, Kim BC, Ham BJ, Bhak J, Lee S. Bhak Y, et al. Transl Psychiatry. 2019 Oct 17;9(1):262. doi: 10.1038/s41398-019-0595-2. Transl Psychiatry. 2019. PMID: 31624227 Free PMC article.

References

    1. Benjamin EJ; Virani SS; Callaway CW; Chamberlain AM; Chang AR; Cheng S; Chiuve SE; Cushman M; Delling FN; Deo R Heart disease and stroke statistics—2018 update: a report from the American Heart Association. Circulation 2018, 137, e67–e492. - PubMed
    1. Fernandes M; Patel A; Husi H C/VDdb: A multi-omics expression profiling database for a knowledge-driven approach in cardiovascular disease (CVD). PloS one 2018, 13, e0207371. - PMC - PubMed
    1. Lau E; Wu JC Omics, Big Data, and Precision Medicine in Cardiovascular Sciences; Am Heart Assoc, 2018; ISBN 0009-7330. - PMC - PubMed
    1. Perez-Riverol Y; Bai M; da Veiga Leprevost F; Squizzato S; Park YM; Haug K; Carroll AJ; Spalding D; Paschall J; Wang M Discovering and linking public omics data sets using the Omics Discovery Index. Nature biotechnology 2017, 35, 406. - PMC - PubMed
    1. Lau E; Cao Q; Lam MP; Wang J; Ng DC; Bleakley BJ; Lee JM; Liem DA; Wang D; Hermjakob H Integrated omics dissection of proteome dynamics during cardiac remodeling. Nature communications 2018, 9, 120. - PMC - PubMed

Publication types

MeSH terms