Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jul 21;15(1):26407.
doi: 10.1038/s41598-025-99164-9.

A deep ensemble framework for human essential gene prediction by integrating multi-omics data

Affiliations

A deep ensemble framework for human essential gene prediction by integrating multi-omics data

Xue Zhang et al. Sci Rep. .

Abstract

Essential genes are necessary for the survival or reproduction of a living organism. The prediction and analysis of gene essentiality can advance our understanding of basic life and human diseases, and further boost the development of new drugs. We propose a snapshot ensemble deep neural network method, DeEPsnap, to predict human essential genes. DeEPsnap integrates the features derived from DNA and protein sequence data with the features extracted or learned from four types of functional data: gene ontology, protein complex, protein domain, and protein-protein interaction networks. More than 200 features from these biological data are extracted/learned which are integrated together to train a series of cost-sensitive deep neural networks. The proposed snapshot mechanism enables us to train multiple models without increasing extra training effort and cost. The experimental results of 10-fold cross-validation show that DeEPsnap can accurately predict human gene essentiality with an average AUROC of 96.16%, AUPRC of 93.83%, and accuracy of 92.36%. The comparative experiments show that DeEPsnap outperforms several popular traditional machine learning models and deep learning models, while all those models show promising performance using the features we created for DeEPsnap. We demonstrated that the proposed method, DeEPsnap, is effective for predicting human essential genes.

Keywords: Deep learning; Essential gene prediction; Multi-omics data integration; Snapshot ensemble.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors have declared that no competing interests exist.

Figures

Fig. 1
Fig. 1
The Flowchart of DeEPsnap.
Fig. 2
Fig. 2
Learning rate cycles for the snapshot ensemble models.
Fig. 3
Fig. 3
The ROC curves of DeEPsnap.
Fig. 4
Fig. 4
Biological process and pathway enrichment of the essential genes. (a) Bar plot of the top enriched BP terms; (b) Bar plot of the top enriched pathways in Reactome.

Similar articles

References

    1. Wang, T. et al. Identification and characterization of essential genes in the human genome. Science350, 1096–1101. 10.1126/science.aac7041 (2015). - PMC - PubMed
    1. Hart, T. et al. High-resolution CRISPR screens reveal fitness genes and genotype-specific cancer liabilities. Cell163, 1515–1526. 10.1016/j.cell.2015.11.015 (2015). - PubMed
    1. Grazziotin, A., Vidal, N. & Venancio, T. Uncovering major genomic features of essential genes in Bacteria and a methanogenic Archaea. FEBS J.282, 3395–3411. 10.1111/febs.13350 (2015). - PMC - PubMed
    1. Liao, B. & Zhang, J. Mouse duplicate genes are as essential as singletons. Trends Genet.23, 378–381. 10.1016/j.tig.2007.05.006 (2007). - PubMed
    1. Morgens, D., Deans, R., Li, A. & Bassik, M. Systematic comparison of CRISPR/Cas9 and RNAi screens for essential genes. Nat. Biotechnol.34, 634–636. 10.1038/nbt.3567 (2016). - PMC - PubMed

LinkOut - more resources