. 2025 Jan 17;8(1):3.

doi: 10.1186/s42492-024-00183-6.

Explainable machine learning framework for cataracts recognition using visual features

Xiao Wu^#¹, Lingxi Hu^#^{1

2}, Zunjie Xiao¹, Xiaoqing Zhang^{1

3}, Risa Higashita^{4

5

6}, Jiang Liu^{7

8

9}

Affiliations

¹ Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, Guangdong, China.
² School of Computer Science, University of Birmingham, Birmingham, B15 2TT, United Kingdom.
³ Center for High Performance Computing and Shenzhen Key Laboratory of Intelligent Bioinformatics, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, Guangdong, China.
⁴ Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, Guangdong, China. risa@mail.sustech.edu.cn.
⁵ Tomey Corporation, Nagoya, 4510051, Japan. risa@mail.sustech.edu.cn.
⁶ Changchun University, Changchun, 130022, Jilin, China. risa@mail.sustech.edu.cn.
⁷ Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, Guangdong, China. liuj@sustech.edu.cn.
⁸ School of Computer Science, University of Nottingham Ningbo China, Ningbo, 315100, Zhejiang, China. liuj@sustech.edu.cn.
⁹ Changchun University, Changchun, 130022, Jilin, China. liuj@sustech.edu.cn.

^# Contributed equally.

PMID: 39821539
PMCID: PMC11748710
DOI: 10.1186/s42492-024-00183-6

Explainable machine learning framework for cataracts recognition using visual features

Xiao Wu et al. Vis Comput Ind Biomed Art. 2025.

. 2025 Jan 17;8(1):3.

doi: 10.1186/s42492-024-00183-6.

Authors

Xiao Wu^#¹, Lingxi Hu^#^{1

2}, Zunjie Xiao¹, Xiaoqing Zhang^{1

3}, Risa Higashita^{4

5

6}, Jiang Liu^{7

8

9}

Affiliations

¹ Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, Guangdong, China.
² School of Computer Science, University of Birmingham, Birmingham, B15 2TT, United Kingdom.
³ Center for High Performance Computing and Shenzhen Key Laboratory of Intelligent Bioinformatics, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, Shenzhen, 518055, Guangdong, China.
⁴ Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, Guangdong, China. risa@mail.sustech.edu.cn.
⁵ Tomey Corporation, Nagoya, 4510051, Japan. risa@mail.sustech.edu.cn.
⁶ Changchun University, Changchun, 130022, Jilin, China. risa@mail.sustech.edu.cn.
⁷ Research Institute of Trustworthy Autonomous Systems and Department of Computer Science and Engineering, Southern University of Science and Technology, Shenzhen, 518055, Guangdong, China. liuj@sustech.edu.cn.
⁸ School of Computer Science, University of Nottingham Ningbo China, Ningbo, 315100, Zhejiang, China. liuj@sustech.edu.cn.
⁹ Changchun University, Changchun, 130022, Jilin, China. liuj@sustech.edu.cn.

^# Contributed equally.

PMID: 39821539
PMCID: PMC11748710
DOI: 10.1186/s42492-024-00183-6

Abstract

Cataract is the leading ocular disease of blindness and visual impairment globally. Deep neural networks (DNNs) have achieved promising cataracts recognition performance based on anterior segment optical coherence tomography (AS-OCT) images; however, they have poor explanations, limiting their clinical applications. In contrast, visual features extracted from original AS-OCT images and their transform forms (e.g., AS-OCT-based histograms) have good explanations but have not been fully exploited. Motivated by these observations, an explainable machine learning framework to recognize cataracts severity levels automatically using AS-OCT images was proposed, consisting of three stages: visual feature extraction, feature importance explanation and selection, and recognition. First, the intensity histogram and intensity-based statistical methods are applied to extract visual features from original AS-OCT images and AS-OCT-based histograms. Subsequently, the SHapley Additive exPlanations and Pearson correlation coefficient methods are applied to analyze the feature importance and select significant visual features. Finally, an ensemble multi-class ridge regression method is applied to recognize the cataracts severity levels based on the selected visual features. Experiments on a clinical AS-OCT-NC dataset demonstrate that the proposed framework not only achieves competitive performance through comparisons with DNNs, but also has a good explanation ability, meeting the requirements of clinical diagnostic practice.

Keywords: Anterior segment optical coherence tomography; Explainable; Machine learning; Nuclear cataract; Visual feature.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: No potential competing interest was reported by the authors.

Figures

**Fig. 1**
Three representative AS-OCT images and their corresponding AS-OCT-based histograms for three NC severity levels: a normal, b mild, and c severe. The AS-OCT-based histograms are built by counting the pixel values in the AS-OCT images. The pixel numbers of each interval are also referred to as intensity. In comparison, the original AS-OCT images of different NC severity levels are very similar, but their AS-OCT based histograms are significantly different

**Fig. 2**
Flowchart of the proposed explainable machine learning framework. Given an AS-OCT image, a deep segmentation network was first applied to segment the nucleus region from AS-OCT images automatically. Secondly, 23 histogram-based statistical features from the AS-OCT-based histogram and four clinical intensity-based statistical features from the original AS-OCT image were extracted. Subsequently, the relative importance of the features and select an informative feature set based on SHAP and PCC was analyzed. Finally, the EMRR to recognize the cataracts severity level was proposed

**Fig. 3**
RR models for different NC severity levels: a normal, b mild, and c severe. The EMRR first calculates the $P (y_{i})$ for each level based on the corresponding model output $y_{i}$ . Subsequently, it selects the level with the largest $P (y_{i}) = 0.525$ as the final output, which is severe

**Fig. 4**
SHAP values between the 27 visual features and NC severity levels: a normal, b mild, c severe, and d overall. The visual features contributed differently to recognizing different NC severity levels, and the height of each denotes their importance

**Fig. 5**
PCC matrix between features. The coefficients ranged from -1 to 1, indicating the degree of correlation. Values close to -1 and 1 indicate a high correlation, whereas values close to 0 indicate a low correlation

**Fig. 6**
Training loss and ACC of different DNNs with a learning rate of 0.001

**Fig. 7**
Confusion matrices of the EMRR, LR, and four other DNNs on AS-OCT-NC dataset with a learning rate of 0.001

**Fig. 8**
Performance comparison of machine learning methods based on histogram-based statistical features in different intensity ranges

**Fig. 9**
Performance comparison of machine learning methods based on histogram-based statistical features in different intensity intervals

See this image and copyright information in PMC

References

1. Wang W, Yan W, Fotis K, Prasad NM, Lansingh VC, Taylor HR et al (2017) Cataract surgical rate and socioeconomics: a global study. Invest Ophthalmol Vis Sci 57(14):5872–5881. 10.1167/iovs.16-19894 - PubMed
1. Zhang XQ, Hu Y, Xiao ZJ, Fang JS, Higashita R, Liu J (2022) Machine learning for cataract classification/grading on ophthalmic imaging modalities: a survey. Mach Intell Res 19(3):184–208. 10.1007/s11633-022-1329-0
1. Gali HE, Sella R, Afshari NA (2019) Cataract grading systems: a review of past and present. Curr Opin Ophthalmol 30(1):13–18. 10.1097/ICU.0000000000000542 - PubMed
1. Liu YC, Wilkins M, Kim T, Malyugin B, Mehta JS (2017) Cataracts. Lancet 390(10094):600–612. 10.1016/S0140-6736(17)30544-5 - PubMed
1. Fu HZ, Xu YW, Lin S, Zhang XQ, Wong DWK, Liu J et al (2017) Segmentation and quantification for angle-closure glaucoma assessment in anterior segment OCT. IEEE Trans Med Imaging 36(9):1930–1938. 10.1109/TMI.2017.2703147 - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- PubMed Central
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Explainable machine learning framework for cataracts recognition using visual features

Affiliations

Explainable machine learning framework for cataracts recognition using visual features

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials

Abstract

Conflict of interest statement

Figures

Similar articles

References

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Research Materials