Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Apr 3;39(4):btad113.
doi: 10.1093/bioinformatics/btad113.

Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment

Affiliations

Interpretable meta-learning of multi-omics data for survival analysis and pathway enrichment

Hyun Jae Cho et al. Bioinformatics. .

Abstract

Motivation: Despite the success of recent machine learning algorithms' applications to survival analysis, their black-box nature hinders interpretability, which is arguably the most important aspect. Similarly, multi-omics data integration for survival analysis is often constrained by the underlying relationships and correlations that are rarely well understood. The goal of this work is to alleviate the interpretability problem in machine learning approaches for survival analysis and also demonstrate how multi-omics data integration improves survival analysis and pathway enrichment. We use meta-learning, a machine-learning algorithm that is trained on a variety of related datasets and allows quick adaptations to new tasks, to perform survival analysis and pathway enrichment on pan-cancer datasets. In recent machine learning research, meta-learning has been effectively used for knowledge transfer among multiple related datasets.

Results: We use meta-learning with Cox hazard loss to show that the integration of TCGA pan-cancer data increases the performance of survival analysis. We also apply advanced model interpretability method called DeepLIFT (Deep Learning Important FeaTures) to show different sets of enriched pathways for multi-omics and transcriptomics data. Our results show that multi-omics cancer survival analysis enhances performance compared with using transcriptomics or clinical data alone. Additionally, we show a correlation between variable importance assignment from DeepLIFT and gene coenrichment, suggesting that genes with higher and similar contribution scores are more likely to be enriched together in the same enrichment sets.

Availability and implementation: https://github.com/berkuva/TCGA-omics-integration.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Our meta-learning model’s parameters θ are trained over a distribution of source cancer types, and the corresponding loss L are generated. θ are optimized by gradient descent, or back-propagation of Ltaski
Figure 2.
Figure 2.
The process of studying the correlation between genes with similarly high DeepLIFT contribution scores and their likelihood of being enriched together in the same STRING enrichment sets

Similar articles

Cited by

References

    1. Alhmoud JF, Woolley JF, Al Moustafa A-E. et al. DNA damage/repair management in cancers. Cancers 2020;12:1050. - PMC - PubMed
    1. Alipanahi B, Delong A, Weirauch MT. et al. Predicting the sequence specificities of DNA- and RNA-binding proteins by deep learning. Nat Biotechnol 2015;33:831–8. - PubMed
    1. Bai Y, Wang G, Fu W. et al. Circulating essential metals and lung cancer: risk assessment and potential molecular effects. Environ Int 2019;127:685–93. - PubMed
    1. Chalise P, Fridley BL.. Integrative clustering of multi-level ‘omic data based on non-negative matrix factorization algorithm. PLoS One 2017;12:e0176278. - PMC - PubMed
    1. Chang YS, Tu SJ, Chen YC et al.Mutation profile of non-small cell lung cancer revealed by next generation sequencing. Respir Res 2021;22(1):3. 10.1186/s12931-020-01608-5. - DOI - PMC - PubMed