Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Mar 28;15(7):867.
doi: 10.3390/diagnostics15070867.

A New Pes Planus Automatic Diagnosis Method: ViT-OELM Hybrid Modeling

Affiliations

A New Pes Planus Automatic Diagnosis Method: ViT-OELM Hybrid Modeling

Derya Avcı. Diagnostics (Basel). .

Abstract

Background/Objectives: Pes planus (flat feet) is a condition characterized by flatter than normal soles of the foot. In this study, a Vision Transformer (ViT)-based deep learning architecture is proposed to automate the diagnosis of pes planus. The model analyzes foot images and classifies them into two classes, as "pes planus" and "not pes planus". In the literature, models based on Convolutional neural networks (CNNs) can automatically perform such classification, regression, and prediction processes, but these models cannot capture long-term addictions and general conditions. Methods: In this study, the pes planus dataset, which is openly available on the Kaggle database, was used. This paper suggests a ViT-OELM hybrid model for automatic diagnosis from the obtained pes planus images. The suggested ViT-OELM hybrid model includes an attention mechanism for feature extraction from the pes planus images. A total of 1000 features obtained for each sample image from this attention mechanism are used as inputs for an Optimum Extreme Learning Machine (OELM) classifier using various activation functions, and are classified. Results: In this study, the performance of this suggested ViT-OELM hybrid model is compared with some other studies, which used the same pes planus database. These comparison results are given. The suggested ViT-OELM hybrid model was trained for binary classification. The performance metrics were computed in testing phase. The model showed 98.04% accuracy, 98.04% recall, 98.05% precision, and an F-1 score of 98.03%. Conclusions: Our suggested ViT-OELM hybrid model demonstrates superior performance compared to those of other studies, which used the same dataset, in the literature.

Keywords: ViT-OELM modeling; automatic diagnosis; optimum extreme learning machine (OELM); pes planus; vision transformer (ViT).

PubMed Disclaimer

Conflict of interest statement

The author declares no conflicts of interest.

Figures

Figure 1
Figure 1
Block diagram of our suggested approach using the ViT-OELM for pes planus diagnosis. In here, numbers and * represent patch and position respectively.
Figure 2
Figure 2
Example images from both pes planus and not pes planus classes in the dataset.
Figure 3
Figure 3
Block diagram of the Transformer encoder–decoder.
Figure 4
Figure 4
Block diagram of the MHA mechanism.
Figure 5
Figure 5
Block diagram of the attention mechanism used in this paper.
Figure 6
Figure 6
The hyperparameter selection and optimization process of the suggested ViT-OELM hybrid model.
Figure 7
Figure 7
The training process for the suggested VİT-OELM hybrid model.
Figure 8
Figure 8
Confusion matrix for binary-class scenario.
Figure 9
Figure 9
The ViT-OELM hybrid model’s performance curves. (a) Accuracy values. (b) Loss values.
Figure 10
Figure 10
(a) ROC Curve, (b) Precision–Recall Curve.

Similar articles

Cited by

References

    1. Gül Y., Yaman S., Avcı D., Çilengir A.H., Balaban M., Güler H. A novel deep transfer learning-based approach for automated Pes Planus diagnosis using X-ray image. Diagnostics. 2023;13:1662. doi: 10.3390/diagnostics13091662. - DOI - PMC - PubMed
    1. Danaci C., Avci D., Tuncer S.A. Diagnosis of pes planus from X-ray images: Enhanced feature selection with deep learning and machine learning techniques. Biomed. Signal Process. Control. 2025;106:107769. doi: 10.1016/j.bspc.2025.107769. - DOI
    1. Gül Y., Yaman S., Avcı D., Çilengir A.H., Balaban M., Güler H. Kaggle. 2023. [(accessed on 10 January 2025)]. Available online: https://www.kaggle.com/datasets/suleyman32/pesplanus-two-class-dataset.
    1. Huang X., Shan J., Vaidya V. Lung nodule detection in CT using 3D convolutional neural networks; Proceedings of the 2017 IEEE 14th International Symposium on Biomedical Imaging (ISBI 2017); Melbourne, VIC, Australia. 18–21 April 2017.
    1. Gopatoti A., Vijayalakshmi P. CXGNet: A tri-phase chest X-ray image classification for COVID-19 diagnosis using deep CNN with enhanced grey-wolf optimizer. Biomed. Signal Process. Control. 2022;77:103860. doi: 10.1016/j.bspc.2022.103860. - DOI - PMC - PubMed

LinkOut - more resources