Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Oct 10;14(1):23784.
doi: 10.1038/s41598-024-72013-x.

Greylag goose optimization and multilayer perceptron for enhancing lung cancer classification

Affiliations

Greylag goose optimization and multilayer perceptron for enhancing lung cancer classification

El-Sayed M Elkenawy et al. Sci Rep. .

Abstract

Lung cancer is an important global health problem, and it is defined by abnormal growth of the cells in the tissues of the lung, mostly leading to significant morbidity and mortality. Its timely identification and correct staging are very important for proper therapy and prognosis. Different computational methods have been used to enhance the precision of lung cancer classification, among which optimization algorithms such as Greylag Goose Optimization (GGO) are employed. These algorithms have the purpose of improving the performance of machine learning models that are presented with a large amount of complex data, selecting the most important features. As per lung cancer classification, data preparation is one of the most important steps, which contains the operations of scaling, normalization, and handling gap factor to ensure reasonable and reliable input data. In this domain, the use of GGO includes refining feature selection, which mainly focuses on enhancing the classification accuracy compared to other binary format optimization algorithms, like bSC, bMVO, bPSO, bWOA, bGWO, and bFOA. The efficiency of the bGGO algorithm in choosing the optimal features for improved classification accuracy is an indicator of the possible application of this method in the field of lung cancer diagnosis. The GGO achieved the highest accuracy with MLP model performance at 98.4%. The feature selection and classification results were assessed using statistical analysis, which utilized the Wilcoxon signed-rank test and ANOVA. The results were also accompanied by a set of graphical illustrations that ensured the adequacy and efficiency of the adopted hybrid method (GGO + MLP).

Keywords: Classification; Feature selection; GGO; Lung cancer; MLP; Optimization.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Comparison between healthy lung and cancer lung.
None
Algorithm 1: GGO algorithm.
None
Algorithm 2: bGGO algorithm.
Fig. 2
Fig. 2
The proposed lung cancer classification framework.
Fig. 3
Fig. 3
Scatter plot for each feature in the dataset.
Fig. 4
Fig. 4
A correlation matrix between features in the dataset.
Fig. 5
Fig. 5
The Average Error of the Results Acquired using bGGO, the Proposed Feature Selection Technique.
Fig. 6
Fig. 6
Analysis plots of the obtained outcomes based on bGGO, the proposed feature selection technique.
Fig. 7
Fig. 7
Assessing the accuracy of the GGO + MLP approach and optimization algorithms using the MLP model, considering the objective function.
Fig. 8
Fig. 8
Histograms of the accuracy results achieved by GGO + MLP approach as well as alternative combinations of optimization techniques with MLP models.
Fig. 9
Fig. 9
Analysis plots of the obtained results using the proposed GGO + MLP approach.
Fig. 10
Fig. 10
KDE plot of accuracy of reference models results.
Fig. 11
Fig. 11
Box plot with swarm overlay: accuracy by reference models.
Fig. 12
Fig. 12
Pairplot with Regression Lines for Reference Models Results.
Fig. 13
Fig. 13
Regression Plot: Accuracy vs. F-Score for Reference Models Results.
Fig. 14
Fig. 14
KDE plot of accuracy of optimization results.
Fig. 15
Fig. 15
Box plot with swarm overlay: accuracy by optimization models.
Fig. 16
Fig. 16
Performance of the Reference models using Accuracy, Sensitivity (TRP), Specificity(TNP) by Model, Pvalue (PPV), Pvalue (NPV), FScore by Optimization Models.

Similar articles

Cited by

References

    1. Asuntha, A. & Srinivasan, A. Deep learning for lung Cancer detection and classification. Multimed. Tools Appl.79, 7731–7762 (2020).
    1. Chaturvedi, P., Jhamb, A., Vanani, M. & Nemade, V. Prediction and classification of lung cancer using machine learning techniques. In IOP Conference Series: Materials Science and Engineering 12059 (IOP Publishing, 2021).
    1. Zhu, Y. et al. Feature selection and performance evaluation of support vector machine (SVM)-based classifier for differentiating benign and malignant pulmonary nodules by computed tomography. J. Digit. Imaging23, 51–65 (2010). - PMC - PubMed
    1. Nanglia, P., Kumar, S., Mahajan, A. N., Singh, P. & Rathee, D. A hybrid algorithm for lung cancer classification using SVM and Neural Networks. ICT Express7(3), 335–341 (2021).
    1. Elshewey, A. M. et al. A Novel WD-SARIMAX model for temperature forecasting using daily Delhi climate dataset. Sustainability15(1), 757 (2022).