Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2023 Mar 18;13(6):1167.
doi: 10.3390/diagnostics13061167.

Phenotyping the Histopathological Subtypes of Non-Small-Cell Lung Carcinoma: How Beneficial Is Radiomics?

Affiliations

Phenotyping the Histopathological Subtypes of Non-Small-Cell Lung Carcinoma: How Beneficial Is Radiomics?

Giovanni Pasini et al. Diagnostics (Basel). .

Abstract

The aim of this study was to investigate the usefulness of radiomics in the absence of well-defined standard guidelines. Specifically, we extracted radiomics features from multicenter computed tomography (CT) images to differentiate between the four histopathological subtypes of non-small-cell lung carcinoma (NSCLC). In addition, the results that varied with the radiomics model were compared. We investigated the presence of the batch effects and the impact of feature harmonization on the models' performance. Moreover, the question on how the training dataset composition influenced the selected feature subsets and, consequently, the model's performance was also investigated. Therefore, through combining data from the two publicly available datasets, this study involves a total of 152 squamous cell carcinoma (SCC), 106 large cell carcinoma (LCC), 150 adenocarcinoma (ADC), and 58 no other specified (NOS). Through the matRadiomics tool, which is an example of Image Biomarker Standardization Initiative (IBSI) compliant software, 1781 radiomics features were extracted from each of the malignant lesions that were identified in CT images. After batch analysis and feature harmonization, which were based on the ComBat tool and were integrated in matRadiomics, the datasets (the harmonized and the non-harmonized) were given as an input to a machine learning modeling pipeline. The following steps were articulated: (i) training-set/test-set splitting (80/20); (ii) a Kruskal-Wallis analysis and LASSO linear regression for the feature selection; (iii) model training; (iv) a model validation and hyperparameter optimization; and (v) model testing. Model optimization consisted of a 5-fold cross-validated Bayesian optimization, repeated ten times (inner loop). The whole pipeline was repeated 10 times (outer loop) with six different machine learning classification algorithms. Moreover, the stability of the feature selection was evaluated. Results showed that the batch effects were present even if the voxels were resampled to an isotropic form and whether feature harmonization correctly removed them, even though the models' performances decreased. Moreover, the results showed that a low accuracy (61.41%) was reached when differentiating between the four subtypes, even though a high average area under curve (AUC) was reached (0.831). Further, a NOS subtype was classified as almost completely correct (true positive rate ~90%). The accuracy increased (77.25%) when only the SCC and ADC subtypes were considered, as well as when a high AUC (0.821) was obtained-although harmonization decreased the accuracy to 58%. Moreover, the features that contributed the most to models' performance were those extracted from wavelet decomposed and Laplacian of Gaussian (LoG) filtered images and they belonged to the texture feature class.. In conclusion, we showed that our multicenter data were affected by batch effects, that they could significantly alter the models' performance, and that feature harmonization correctly removed them. Although wavelet features seemed to be the most informative features, an absolute subset could not be identified since it changed depending on the training/testing splitting. Moreover, performance was influenced by the chosen dataset and by the machine learning methods, which could reach a high accuracy in binary classification tasks, but could underperform in multiclass problems. It is, therefore, essential that the scientific community propose a more systematic radiomics approach, focusing on multicenter studies, with clear and solid guidelines to facilitate the translation of radiomics to clinical practice.

Keywords: CT; NSCLC; harmonization; machine learning; multicenter; non-small-cell lung carcinoma; phenotyping; radiomics.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Data preparation workflow.
Figure 2
Figure 2
Feature extraction workflow.
Figure 3
Figure 3
Final dataset subdivision into the 4-classes dataset, 3-classes dataset, and the 2-classes dataset.
Figure 4
Figure 4
Batch analysis and harmonization pipeline.
Figure 5
Figure 5
Modeling pipeline. The feature selection process is in yellow and the model building process is in purple.
Figure 6
Figure 6
The PCA scatter plot on the (left) and the tSNE scatter plot on the (right) for the 4-classes dataset.
Figure 7
Figure 7
The PCA scatter plot on the (left) and the tSNE scatter plot on the (right) for the 3-classes dataset.
Figure 8
Figure 8
The PCA scatter plot on the (left) and the tSNE scatter plot on the (right) for the 2-classes dataset.
Figure 9
Figure 9
The PCA scatter plot after harmonization. The 4-classes are on the top-left, the 3-classes are on the top-right, the 2-classes are on the bottom-left.
Figure 10
Figure 10
Accuracy heatmaps: (a) The 4-classes datasets, (b) 3-classes datasets, and (c) 2-classes datasets.
Figure 11
Figure 11
AUC heatmaps: (a) The 4-classes datasets, (b) 3-classes datasets, and (c) 2-classes datasets.
Figure 12
Figure 12
Accuracy heatmaps for the harmonized datasets: (a) The 4-classes datasets, (b) 3-classes datasets, and (c) 2-classes datasets.
Figure 13
Figure 13
AUC heatmaps for the harmonized datasets: (a) The 4-classes datasets, (b) 3-classes datasets, and (c) 2-classes datasets.
Figure 14
Figure 14
ROC curves: (top) The 4-classes datasets, (middle) 3-classes datasets, and (bottom) 2-classes datasets.
Figure 14
Figure 14
ROC curves: (top) The 4-classes datasets, (middle) 3-classes datasets, and (bottom) 2-classes datasets.
Figure 15
Figure 15
ROC curves for the harmonized datasets: (top) The 4-classes datasets, (middle) 3-classes datasets, and (bottom) 2-classes datasets.
Figure 15
Figure 15
ROC curves for the harmonized datasets: (top) The 4-classes datasets, (middle) 3-classes datasets, and (bottom) 2-classes datasets.

References

    1. Siegel R.L., Miller K.D., Fuchs H.E., Jemal A. Cancer statistics. CA Cancer J. Clin. 2022;72:7–33. doi: 10.3322/caac.21708. - DOI - PubMed
    1. Dalmartello M., La Vecchia C., Bertuccio P., Boffetta P., Levi F., Negri E., Malvezzi M. European cancer mortality predictions for the year 2022 with focus on ovarian cancer. Ann. Oncol. 2022;33:330–339. doi: 10.1016/j.annonc.2021.12.007. - DOI - PubMed
    1. Duma N., Santana-Davila R., Molina J.R. Non-Small Cell Lung Cancer: Epidemiology, Screening, Diagnosis, and Treatment. Mayo Clin. Proc. 2019;94:1623–1640. doi: 10.1016/j.mayocp.2019.01.013. - DOI - PubMed
    1. Travis W.D., Brambilla E., Nicholson A.G., Yatabe Y., Austin J.H.M., Beasley M.B., Chirieac L.R., Dacic S., Duhig E., Flieder D.B., et al. The 2015 World Health Organization Classification of Lung Tumors: Impact of Genetic, Clinical and Radiologic Advances Since the 2004 Classification. J. Thorac. Oncol. 2015;10:1243–1260. doi: 10.1097/JTO.0000000000000630. - DOI - PubMed
    1. Xing P., Zhu Y., Wang L., Hui Z., Liu S., Ren J., Zhang Y., Song Y., Liu C., Huang Y., et al. What are the clinical symptoms and physical signs for non-small cell lung cancer before diagnosis is made? A nation-wide multicenter 10-year retrospective study in China. Cancer Med. 2019;8:4055–4069. doi: 10.1002/cam4.2256. - DOI - PMC - PubMed