Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Aug 1;39(8):btad513.
doi: 10.1093/bioinformatics/btad513.

dRFEtools: dynamic recursive feature elimination for omics

Affiliations

dRFEtools: dynamic recursive feature elimination for omics

Kynon J M Benjamin et al. Bioinformatics. .

Abstract

Motivation: Advances in technology have generated larger omics datasets with potential applications for machine learning. In many datasets, however, cost and limited sample availability result in an excessively higher number of features as compared to observations. Moreover, biological processes are associated with networks of core and peripheral genes, while traditional feature selection approaches capture only core genes.

Results: To overcome these limitations, we present dRFEtools that implements dynamic recursive feature elimination (RFE), reducing computational time with high accuracy compared to standard RFE, expanding dynamic RFE to regression algorithms, and outputting the subsets of features that hold predictive power with and without peripheral features. dRFEtools integrates with scikit-learn (the popular Python machine learning platform) and thus provides new opportunities for dynamic RFE in large-scale omics data while enhancing its interpretability.

Availability and implementation: dRFEtools is freely available on PyPI at https://pypi.org/project/drfetools/ or on GitHub https://github.com/LieberInstitute/dRFEtools, implemented in Python 3, and supported on Linux, Windows, and Mac OS.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflicts of interest.

Figures

Figure 1.
Figure 1.
Schematic of dynamic recursive feature elimination for dRFEtools. (A) Graphical representation of core and peripheral features (Boyle et al., 2017). (B) Flowchart showing recursive elimination process, where scikit-learn model can be either classification or regression. We ranked the dropped features and saved them for downstream analysis. (C) Feature iterator code used to generate dynamic elimination. (D) Flowchart showing the elimination steps using the developmental or out-of-bag (OOB) set. (E) Example of LOWESS fitting on dRFEtools model performance multiple classification using simulated data using area under the receiver operating characteristic curve (ROC AUC).

References

    1. Boyle EA, , LiYI, , Pritchard JK.. An expanded view of complex traits: From polygenic to omnigenic. Cell 2017;169:1177–86. - PMC - PubMed
    1. Collado-Torres L, Burke EE, Peterson A. et al.; BrainSeq Consortium. Regional heterogeneity in gene expression, regulation, and coherence in the frontal cortex and hippocampus across development and schizophrenia. Neuron 2019;103:203–16.e8. - PMC - PubMed
    1. Jaffe AE, Straub RE, Shin JH. et al.; BrainSeq Consortium. Developmental and genetic regulation of the human cortex transcriptome illuminate schizophrenia pathogenesis. Nat. Neurosci. 2018;21:1117–25. - PMC - PubMed
    1. Matthews H, , HanisonJ, , Nirmalan N.. Omics"-informed drug and biomarker discovery: Opportunities, challenges and future perspectives. Proteomes 2016;4. - PMC - PubMed
    1. Nguyen H-N, Ohn S-Y. dRFE: dynamic recursive feature elimination for gene identification based on random forest. In: International conference on neural information processing. 2006; 4234.

Publication types