. 2025 Jan 2;16(1):60.

doi: 10.1038/s41467-024-55198-7.

Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology

Anand E Rajesh^#^{1

2}, Abraham Olvera-Barrios^#³, Alasdair N Warwick^{3

4}, Yue Wu^{1

2}, Kelsey V Stuart³, Mahantesh I Biradar³, Chuin Ying Ung⁵, Anthony P Khawaja^{3

6}, Robert Luben^{3

6}, Paul J Foster³, Charles R Cleland^{7

8}, William U Makupa⁸, Alastair K Denniston⁹, Matthew J Burton^{3

7}, Andrew Bastawrous^{8

10}, Pearse A Keane³, Mark A Chia³, Angus W Turner¹¹, Cecilia S Lee^{1

2}, Adnan Tufail³, Aaron Y Lee^{1

2}, Catherine Egan¹²; UK Biobank Eye and Vision Consortium

Collaborators, Affiliations

Collaborators

UK Biobank Eye and Vision Consortium:
Naomi Allen, Tariq Aslam, Denize Atan, Konstantinos Balaskas, Sarah Barman, Jenny Barrett, Paul Bishop, Graeme Black, Tasanee Braithwaite, Roxana Carare, Usha Chakravarthy, Michelle Chan, Sharon Chua, Alexander Day, Parul Desai, Baljean Dhillon, Andrew Dick, Alexander Doney, Sarah Ennis, John Gallacher, David Ted Garway-Heath, Jane Gibson, Jeremy Guggenheim, Chris Hammond, Alison Hardcastle, Simon Harding, Ruth Hogg, Pirro Hysi, Gerassimos Lascaratos, Thomas Littlejohns, Andrew Lotery, Phil Luthert, Tom MacGillivray, Sarah Mackie, Savita Madhusudhan, Bernadette McGuinness, Gareth McKay, Martin McKibbin, Tony Moore, James Morgan, Eoin O'Sullivan, Richard Oram, Chris Owen, Praveen Patel, Euan Paterson, Tunde Peto, Axel Petzold, Nikolas Pontikos, Jugnoo Rahi, Alicja Rudnicka, Naveed Sattar, Jay Self, Panagiotis Sergouniotis, Sobha Sivaprasad, David Steel, Irene Stratton, Nicholas Strouthidis, Cathie Sudlow, Zihan Sun, Robyn Tapp, Dhanes Thomas, Emanuele Trucco, Ananth Viswanathan, Veronique Vitart, Mike Weedon, Katie Williams, Cathy Williams, Jayne Woodside, Max Yates, Yalin Zheng

Affiliations

¹ Department of Ophthalmology, University of Washington, Seattle, WA, USA.
² The Roger and Angie Karalis Johnson Retina Center, Seattle, WA, USA.
³ NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology, London, UK.
⁴ University College London Institute of Cardiovascular Science, London, UK.
⁵ Guy's and St Thomas' NHS Foundation Trust, London, UK.
⁶ MRC Epidemiology Unit, University of Cambridge, Cambridge, UK.
⁷ International Centre for Eye Health, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK.
⁸ Eye Department, Kilimanjaro Christian Medical Centre, Moshi, United Republic of Tanzania.
⁹ NIHR Birmingham Biomedical Research Centre, Birmingham, UK.
¹⁰ PEEK Vision, Berkhamsted, UK.
¹¹ Lions Eye Institute, University of Western Australia, Nedlands, WA, Australia.
¹² NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology, London, UK. cathy.egan@nhs.net.

^# Contributed equally.

PMID: 39746957
PMCID: PMC11696055
DOI: 10.1038/s41467-024-55198-7

Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology

Anand E Rajesh et al. Nat Commun. 2025.

. 2025 Jan 2;16(1):60.

doi: 10.1038/s41467-024-55198-7.

Authors

Collaborators

UK Biobank Eye and Vision Consortium:
Naomi Allen, Tariq Aslam, Denize Atan, Konstantinos Balaskas, Sarah Barman, Jenny Barrett, Paul Bishop, Graeme Black, Tasanee Braithwaite, Roxana Carare, Usha Chakravarthy, Michelle Chan, Sharon Chua, Alexander Day, Parul Desai, Baljean Dhillon, Andrew Dick, Alexander Doney, Sarah Ennis, John Gallacher, David Ted Garway-Heath, Jane Gibson, Jeremy Guggenheim, Chris Hammond, Alison Hardcastle, Simon Harding, Ruth Hogg, Pirro Hysi, Gerassimos Lascaratos, Thomas Littlejohns, Andrew Lotery, Phil Luthert, Tom MacGillivray, Sarah Mackie, Savita Madhusudhan, Bernadette McGuinness, Gareth McKay, Martin McKibbin, Tony Moore, James Morgan, Eoin O'Sullivan, Richard Oram, Chris Owen, Praveen Patel, Euan Paterson, Tunde Peto, Axel Petzold, Nikolas Pontikos, Jugnoo Rahi, Alicja Rudnicka, Naveed Sattar, Jay Self, Panagiotis Sergouniotis, Sobha Sivaprasad, David Steel, Irene Stratton, Nicholas Strouthidis, Cathie Sudlow, Zihan Sun, Robyn Tapp, Dhanes Thomas, Emanuele Trucco, Ananth Viswanathan, Veronique Vitart, Mike Weedon, Katie Williams, Cathy Williams, Jayne Woodside, Max Yates, Yalin Zheng

Affiliations

¹ Department of Ophthalmology, University of Washington, Seattle, WA, USA.
² The Roger and Angie Karalis Johnson Retina Center, Seattle, WA, USA.
³ NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology, London, UK.
⁴ University College London Institute of Cardiovascular Science, London, UK.
⁵ Guy's and St Thomas' NHS Foundation Trust, London, UK.
⁶ MRC Epidemiology Unit, University of Cambridge, Cambridge, UK.
⁷ International Centre for Eye Health, Faculty of Infectious and Tropical Diseases, London School of Hygiene & Tropical Medicine, London, UK.
⁸ Eye Department, Kilimanjaro Christian Medical Centre, Moshi, United Republic of Tanzania.
⁹ NIHR Birmingham Biomedical Research Centre, Birmingham, UK.
¹⁰ PEEK Vision, Berkhamsted, UK.
¹¹ Lions Eye Institute, University of Western Australia, Nedlands, WA, Australia.
¹² NIHR Biomedical Research Centre, Moorfields Eye Hospital NHS Foundation Trust & University College London Institute of Ophthalmology, London, UK. cathy.egan@nhs.net.

^# Contributed equally.

PMID: 39746957
PMCID: PMC11696055
DOI: 10.1038/s41467-024-55198-7

Abstract

Few metrics exist to describe phenotypic diversity within ophthalmic imaging datasets, with researchers often using ethnicity as a surrogate marker for biological variability. We derived a continuous, measured metric, the retinal pigment score (RPS), that quantifies the degree of pigmentation from a colour fundus photograph of the eye. RPS was validated using two large epidemiological studies with demographic and genetic data (UK Biobank and EPIC-Norfolk Study) and reproduced in a Tanzanian, an Australian, and a Chinese dataset. A genome-wide association study (GWAS) of RPS from UK Biobank identified 20 loci with known associations with skin, iris and hair pigmentation, of which eight were replicated in the EPIC-Norfolk cohort. There was a strong association between RPS and ethnicity, however, there was substantial overlap between each ethnicity and the respective distributions of RPS scores. RPS decouples traditional demographic variables from clinical imaging characteristics. RPS may serve as a useful metric to quantify the diversity of the training, validation, and testing datasets used in the development of AI algorithms to ensure adequate inclusion and explainability of the model performance, critical in evaluating all currently deployed AI models. The code to derive RPS is publicly available at: https://github.com/uw-biomedical-ml/retinal-pigmentation-score .

PubMed Disclaimer

Conflict of interest statement

Competing interests: A.P.K. has acted as a paid consultant or lecturer to Abbvie, Aerie, Allergan, Google Health, Heidelberg Engineering, Novartis, Reichert, Santen,Thea and Topcon. A.Y.L. reports support from the US Food and Drug Administration, grants from Santen, Carl Zeiss Meditec, and Novartis, personal fees from Genentech, Topcon, and Verana Health, outside of the submitted work; This article does not reflect the opinions of the Food and Drug Administration. A.T. report grants from Bayer and Novartis and personal fees from Abbvie, Allegro, Annexon, Apellis, Bayer, Heidelberg Engineering, Iveric Bio, Kanghong, Novartis, Oxurion, Roche/Genentech, Thea. C.E. reports personal fees from Heidelberg Engineering, Boehringer Ingelheim, and Inozyme pharmaceuticals outside of the submitted work. P.A.K. has acted as a consultant for Retina Consultants of America, Topcon, Roche, Boehringer-Ingleheim, and Bitfount and is an equity owner in Big Picture Medical. He has received speaker fees from Zeiss, Novartis, Gyroscope, Boehringer-Ingleheim, Apellis, Roche, Abbvie, Topcon, and Hakim Group. He has received travel support from Bayer, Topcon, and Roche. He has attended advisory boards for Topcon, Bayer, Boehringer-Ingleheim, RetinAI, and Novartis. P.J.F. has acted as a consultant for Alphasights, GLG, Google Health, Guidepoint, PwC, Santen. A.B. is Founder and CEO of not-for-profit Peek Vision and receives a salary. The remaining authors declare no competing interests.

Figures

**Fig. 1. Schematic showing the method to generate the retinal pigmentation score (RPS) from a colour fundus image.**
Input images are fed into the deep learning algorithm to generate segmentation masks. These are added together to make a retinal background mask, which is then transformed into L,a,b colorspace. The chromaticity vectors are then extracted and transformed by the principal component analysis model to create the RPS. Created with Biorender.com.

**Fig. 2. Representative fundus photos with associated RPS.**
a Randomly sampled colour fundus photographs from each UK Biobank self-reported ethnicity and from the Tanzanian, Australian, and Chinese (ODIR) datasets, sorted by quintiles of retinal pigment score (RPS) across the entire distribution of RPS for the UK Biobank cohort. The RGB colour of the pixel value that is converted into RPS as well as the RPS is shown at the bottom of each fundus photograph. Black spaces represent when there are no suitable images within the respective ethnicity subgroup and quintile b Normalised kernel density estimation plot of the distribution of RPS for all participants grouped by self-reported ethnicity as reported in the UK Biobank as well as the Tanzanian, Australian, and Chinese (ODIR) datasets. Relative frequencies are normalised so the area under each curve is equal for each ethnicity subgroup. The subpanel consists of examples where for a given RPS and the a,b values in the CIELAB colour space are constant but the L vector changes. The x-axis is shared in both subpanels. Source data are provided as a Source Data file.

**Fig. 3. Manhattan plot of GWAS results from the discovery cohort (UKBiobank, n = 37067).**
The Y-axis represents the two-sided p-values from the linear mixed effects model. Lead variants identified by GCTA-COJO are annotated with the nearest gene. Points are truncated at −log10(p) = 70 for clarity. The dashed red line indicates genome-wide significance (p = 5 × 10⁻⁸) which is adjusted for multiple comparisons and the p-values are two-sided and calculated with the z-statistic. Source data are provided as a Source Data file.

**Fig. 4. Comparison of betas for lead variants identified from the discovery and replication cohort.**
Comparison of betas expressed as change in standard deviation of mean RPS for lead variants identified from the discovery (UK Biobank, n = 37067) genome-wide association study (GWAS) with their corresponding betas in the replication (EPIC-Norfolk, n = 4273) analysis, with 95% confidence intervals. Betas in the cohort were calculated using a generalised linear mixed model, adjusting for age, sex and the first ten principal components. P-values are two-sided, calculated from the z-statistic and corrected for multiple comparisons. Variants meeting the Bonferroni-adjusted replication significance threshold (p = 0.05/ variants) in the EPIC-Norfolk GWAS are shaded black. The nearest gene is annotated for variants achieving genome-wide significance. Source data are provided as a Source Data file.

See this image and copyright information in PMC

Update of

Ethnicity is not biology: retinal pigment score to evaluate biological variability from ophthalmic imaging using machine learning.
Rajesh AE, Olvera-Barrios A, Warwick AN, Wu Y, Stuart KV, Biradar M, Ung CY, Khawaja AP, Luben R, Foster PJ, Lee CS, Tufail A, Lee AY, Egan C; EPIC Norfolk, UK Biobank Eye and Vision Consortium. Rajesh AE, et al. medRxiv [Preprint]. 2023 Jul 6:2023.06.28.23291873. doi: 10.1101/2023.06.28.23291873. medRxiv. 2023. Update in: Nat Commun. 2025 Jan 2;16(1):60. doi: 10.1038/s41467-024-55198-7. PMID: 37461664 Free PMC article. Updated. Preprint.

References

1. Flaxman, S. R. et al. Global causes of blindness and distance vision impairment 1990-2020: a systematic review and meta-analysis. Lancet Glob. Health5, e1221–e1234 (2017). - DOI - PubMed
1. Wong, W. L. et al. Global prevalence of age-related macular degeneration and disease burden projection for 2020 and 2040: a systematic review and meta-analysis. Lancet Glob. Health2, e106–e116 (2014). - DOI - PubMed
1. Teo, Z. L. et al. Global prevalence of diabetic retinopathy and projection of burden through 2045: Systematic review and meta-analysis. Ophthalmology128, 1580–1591 (2021). - DOI - PubMed
1. Lee, A. Y. et al. Multicenter, head-to-head, real-world validation study of seven automated artificial intelligence diabetic retinopathy screening systems. Diabetes Care44, 1168–1175 (2021). - DOI - PMC - PubMed
1. Tufail, A. et al. Automated diabetic retinopathy image assessment software: diagnostic accuracy and cost-effectiveness compared with human graders. Ophthalmology124, 343–351 (2017). - DOI - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
- Nature Publishing Group
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology

Collaborators

Affiliations

Machine learning derived retinal pigment score from ophthalmic imaging shows ethnicity is not biology

Authors

Collaborators

Affiliations

Abstract

Conflict of interest statement

Figures

Update of

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources