Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 8:14:1112914.
doi: 10.3389/fgene.2023.1112914. eCollection 2023.

Bayesian feature selection for radiomics using reliability metrics

Affiliations

Bayesian feature selection for radiomics using reliability metrics

Katherine Shoemaker et al. Front Genet. .

Abstract

Introduction: Imaging of tumors is a standard step in diagnosing cancer and making subsequent treatment decisions. The field of radiomics aims to develop imaging based biomarkers using methods rooted in artificial intelligence applied to medical imaging. However, a challenging aspect of developing predictive models for clinical use is that many quantitative features derived from image data exhibit instability or lack of reproducibility across different imaging systems or image-processing pipelines. Methods: To address this challenge, we propose a Bayesian sparse modeling approach for image classification based on radiomic features, where the inclusion of more reliable features is favored via a probit prior formulation. Results: We verify through simulation studies that this approach can improve feature selection and prediction given correct prior information. Finally, we illustrate the method with an application to the classification of head and neck cancer patients by human papillomavirus status, using as our prior information a reliability metric quantifying feature stability across different imaging systems.

Keywords: Bayesian modeling; classification; probit prior; quantitative imaging; radiomics; variable selection.

PubMed Disclaimer

Conflict of interest statement

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Figures

FIGURE 1
FIGURE 1
Overview of radiomics. Here, we use images from Aerts et al. (2014) to illustrate the initial step of tumor imaging and segmentation. Quantitative features can then be extracted that summarize the tumor boundary (shape features), the distribution of the pixel intensity values within the tumor (histogram features), and aspects of the spatial relations among pixels with differing intensities (texture features). More advanced features such as model- or tree-based summaries may also be computed. These features can then be used as input to approaches aimed at prediction of survival outcomes or classification methods such as mixture models, as illustrated generically in Step 3.
FIGURE 2
FIGURE 2
Schematic illustration of sparse Bayesian classification model. Feature-specific prior information is incorporated through the probit prior.
FIGURE 3
FIGURE 3
The posterior probability of inclusion for all variables from the proposed method, for one simulated data set. The first four variables are the discriminatory variables, and the remaining 100 are the noise variables.
FIGURE 4
FIGURE 4
Classification accuracy: ROC curves for the RVS model, the Bayesian model with a neutral prior, lasso logistic regression, and SVM on an example simulated test set, as described in Section 4.1.2.
FIGURE 5
FIGURE 5
Classification accuracy: ROC curves for the RVS model, the Bayesian model with a neutral prior, lasso logistic regression, and SVM on a test set with unequal group sizes, as described in Section 4.1.3.

References

    1. Aerts H. J., Velazquez E. R., Leijenaar R. T., Parmar C., Grossmann P., Cavalho S., et al. (2014). Decoding tumour phenotype by noninvasive imaging using a quantitative radiomics approach. Nat. Commun. 5, 4006–4008. 10.1038/ncomms5006 - DOI - PMC - PubMed
    1. Ahn H., Lee H., Kim S., Hyun S. (2019). Pre-treatment 18F-FDG PET-based radiomics predict survival in resected non-small cell lung cancer. Clin. Radiol. 74, 467–473. 10.1016/j.crad.2019.02.008 - DOI - PubMed
    1. Barbieri M. M., Berger J. O. (2004). Optimal predictive model selection. Ann. Statistics 32, 870–897. 10.1214/009053604000000238 - DOI
    1. Bhadra A., Datta J., Polson N. G., Willard B. (2017). The horseshoe+ estimator of ultra-sparse signals. Bayesian Anal. 12, 1105–1131. 10.1214/16-ba1028 - DOI
    1. Blei D. M., Kucukelbir A., McAuliffe J. D. (2017). Variational inference: A review for statisticians. J. Am. Stat. Assoc. 112, 859–877. 10.1080/01621459.2017.1285773 - DOI

LinkOut - more resources