Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2025 Dec 2:rs.3.rs-8002154.
doi: 10.21203/rs.3.rs-8002154/v1.

Deep Learning Detection of Retinitis Pigmentosa Inheritance Forms through Synthetic Data Expansion of a Rare Disease Dataset

Affiliations

Deep Learning Detection of Retinitis Pigmentosa Inheritance Forms through Synthetic Data Expansion of a Rare Disease Dataset

Elizabeth E Hwang et al. Res Sq. .

Abstract

Accurate classification of inheritance patterns is an integral part of diagnosis and genetic counseling for inherited retinal diseases (IRDs). Traditionally reliant on pedigree analysis, clinical phenotyping, and genetic testing, this process is often constrained by incomplete family history, ambiguous presentations, limited access to genetic testing, and inconclusive genetic test results. Deep learning (DL) applied to fundus imaging presents a promising approach for automated inference of inheritance modes; however, development has been hindered by the low prevalence of IRDs and the scarcity of annotated datasets. In this study, we focus on retinitis pigmentosa (RP), a highly heterogeneous disorder in both clinical presentation and genetic etiology. We present a first-in-class deep learning approach that leverages Vision Transformer (ViT) models to distinguish autosomal from X-linked RP using color fundus photography. To overcome challenges posed by limited data, we introduce an innovative variational autoencoder-based data expansion strategy, which improves inheritance pattern classification based on color fundus photos from 0.67 AUC to 0.79 AUC. Our findings demonstrate the potential of deep learning to uncover subtle phenotypic differences linked to genetic inheritance and introduce a novel training data augmentation method to render deep learning accessible to rare diseases.

PubMed Disclaimer

Conflict of interest statement

Additional Declarations: No competing interests reported. Competing Interests: All authors declare no financial or non-financial competing interests.

Figures

Figure 1
Figure 1
Two-way combinatorial synthetic image generation using variational autoencoder.
Figure 2
Figure 2
Examples of Synthetic Images based on Autosomal Dominant (AD) RP images. Left to right A/B ratios: 0.1, 0.3, 0.5, 0.7, 0.9.
Figure 3
Figure 3
Workflow for Data Expansion and Vision Transformer Model Training. VAE = Variational Autoencoder, ViT = Vision Transformer.
Figure 4
Figure 4
Patients mean age (panel A) and median symptom duration at time of imaging (panel B). AD = Autosomal Dominant, AR = Autosomal Recessive, XL = X-linked Recessive and X-linked Carrier (combined for anonymization purposes). ** denotes p-value < 0.005 and *** denotes p-value < 0.0001
Figure 5
Figure 5
A. Receiver Operating Characteristic (ROC) curve with pooled AUC reported for RP ViT base (unexpanded) model. B. Confusion matrix for the base model.
Figure 6
Figure 6
A. Receiver Operating Characteristic (ROC) curves for RP-ViT base and Gen 1 (random noise) expanded models. B. Confusion matrix for Gen 1 (random noise)-expanded models.
Figure 7
Figure 7
Receiver Operating Characteristic (ROC) curves for RP-ViT base, Gen 1 random noise-expanded, and Gen 2 pair-wise combinatorial expanded models. B. Confusion matrix for Gen 2 RP-ViT model.

References

    1. Abramoff M. D., Lavin P. T., Birch M., Shah N. & Folk J. C. Pivotal trial of an autonomous AI-based diagnostic system for detection of diabetic retinopathy in primary care offices. NPJ Digit. Med. 1, 39. 10.1038/s41746-018-0040-6 (2018). - DOI - PMC - PubMed
    1. Aggarwal R. et al. Diagnostic accuracy of deep learning in medical imaging: a systematic review and meta-analysis. NPJ Digit. Med. 4(1), 65. 10.1038/s41746-021-00438-z (2021). - DOI - PMC - PubMed
    1. Allwein E. L., Schapire R. E. & Singer Y. Reducing Multiclass to Binary: A Unifying Approach for Margin Classifier. J. Mach. Learn. Res. 1, 113–141 (2000). https://www.jmlr.org/papers/v1/allwein00a.html
    1. Berk A. et al. Learning from small data: Classifying sex from retinal images via deep learning. PLoS One. 18(8), e0289211. 10.1371/journal.pone.0289211 (2023). - DOI - PMC - PubMed
    1. Britten-Jones A. C. et al. Patient experiences and perceived value of genetic testing in inherited retinal diseases: a cross-sectional survey. Sci. Rep. 14(1), 5403. 10.1038/s41598-024-56121-2 (2024). - DOI - PMC - PubMed

Publication types

LinkOut - more resources