Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Apr 3;19(4):e0300170.
doi: 10.1371/journal.pone.0300170. eCollection 2024.

Machine learning for differentiating lung squamous cell cancer from adenocarcinoma using Clinical-Metabolic characteristics and 18F-FDG PET/CT radiomics

Affiliations

Machine learning for differentiating lung squamous cell cancer from adenocarcinoma using Clinical-Metabolic characteristics and 18F-FDG PET/CT radiomics

Yalin Zhang et al. PLoS One. .

Abstract

Noninvasive differentiation between the squamous cell carcinoma (SCC) and adenocarcinoma (ADC) subtypes of non-small cell lung cancer (NSCLC) could benefit patients who are unsuitable for invasive diagnostic procedures. Therefore, this study evaluates the predictive performance of a PET/CT-based radiomics model. It aims to distinguish between the histological subtypes of lung adenocarcinoma and squamous cell carcinoma, employing four different machine learning techniques. A total of 255 Non-Small Cell Lung Cancer (NSCLC) patients were retrospectively analyzed and randomly divided into the training (n = 177) and validation (n = 78) sets, respectively. Radiomics features were extracted, and the Least Absolute Shrinkage and Selection Operator (LASSO) method was employed for feature selection. Subsequently, models were constructed using four distinct machine learning techniques, with the top-performing algorithm determined by evaluating metrics such as accuracy, sensitivity, specificity, and the area under the curve (AUC). The efficacy of the various models was appraised and compared using the DeLong test. A nomogram was developed based on the model with the best predictive efficiency and clinical utility, and it was validated using calibration curves. Results indicated that the logistic regression classifier had better predictive power in the validation cohort of the radiomic model. The combined model (AUC 0.870) exhibited superior predictive power compared to the clinical model (AUC 0.848) and the radiomics model (AUC 0.774). In this study, we discovered that the combined model, refined by the logistic regression classifier, exhibited the most effective performance in classifying the histological subtypes of NSCLC.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

Fig 1
Fig 1. The workflow of this study.
Fig 2
Fig 2. Number and ratio of handcrafted features.
A show CT features, B show PET features.
Fig 3
Fig 3. Radiomic features selected using a LASSO regression model for subgroups.
A-C The coefficients of each feature in the most predictive feature subset. The abscissa is the coefficient, and the ordinate shows the reserved features. The larger the coefficient is, the more predictive effect of the feature is. A shows feature selected in the clinic model, B shows feature selected in the RS model, C shows feature selected in the combined model, D MSE of 10 fold cross validation. E Coefficients of 10 fold cross validation.
Fig 4
Fig 4. Comparison of receiver operating characteristic (ROC) curves for predicting subtype of pathology.
A shows the ROC curve of LR in the training cohort; B shows the ROC curve of LR in the validation cohort.
Fig 5
Fig 5. Clinical utility of prediction models.
A shows Nomogram of a clinical radiomics model developed based on a logistic regression model for the training cohort. gender 1:male 2:female. B,C show that Decision curve analysis (DCA) was conducted for the prediction model based on the logistic regression model in the training (B) and validation cohorts (C). D,E show Calibration curves of the nomogram based on the logistic regression model in the training (D) and validation cohorts (E).

References

    1. Sung H, Ferlay J, Siegel RL, Laversanne M, Soerjomataram I, Jemal A, et al.. Global Cancer Statistics 2020: GLOBOCAN Estimates of Incidence and Mortality Worldwide for 36 Cancers in 185 Countries. CA: A Cancer Journal for Clinicians. 2021; 209–249. doi: 10.3322/caac.21660 - DOI - PubMed
    1. Bradley JD, Paulus R, Komaki R, Masters G, Blumenschein G, Schild S, et al.. Standard-dose versus high-dose conformal radiotherapy with concurrent and consolidation carboplatin plus paclitaxel with or without cetuximab for patients with stage IIIA or IIIB non-small-cell lung cancer (RTOG 0617): a randomised, two-by-two factorial phase 3 study. The Lancet Oncology. 2015;16: 187–199. doi: 10.1016/S1470-2045(14)71207-0 - DOI - PMC - PubMed
    1. Takamochi K, Ohmiya H, Itoh M, Mogushi K, Saito T, Hara K, et al.. Novel biomarkers that assist in accurate discrimination of squamous cell carcinoma from adenocarcinoma of the lung. BMC Cancer. 2016;16. doi: 10.1186/s12885-016-2792-1 - DOI - PMC - PubMed
    1. Bryan S, Masoud H, Weir Hannah K, Woods R, Lockwood G, Smith Lisa F, et al.. Cancer in Canada: Stage at diagnosis. Health Reports, Health Reports. 2018. - PubMed
    1. Cooper WA, O’Toole S, Boyer M, Horvath L, Mahar A. What’s new in non-small cell lung cancer for pathologists the importance of accurate subtyping, EGFR mutations and ALK rearrangements. Pathology. 2011;43: 103–115. doi: 10.1097/PAT.0b013e328342629d - DOI - PubMed

MeSH terms

Substances