BRSET: A Brazilian Multilabel Ophthalmological Dataset of Retina Fundus Photos
- PMID: 38991014
- PMCID: PMC11239107
- DOI: 10.1371/journal.pdig.0000454
BRSET: A Brazilian Multilabel Ophthalmological Dataset of Retina Fundus Photos
Abstract
Introduction: The Brazilian Multilabel Ophthalmological Dataset (BRSET) addresses the scarcity of publicly available ophthalmological datasets in Latin America. BRSET comprises 16,266 color fundus retinal photos from 8,524 Brazilian patients, aiming to enhance data representativeness, serving as a research and teaching tool. It contains sociodemographic information, enabling investigations into differential model performance across demographic groups.
Methods: Data from three São Paulo outpatient centers yielded demographic and medical information from electronic records, including nationality, age, sex, clinical history, insulin use, and duration of diabetes diagnosis. A retinal specialist labeled images for anatomical features (optic disc, blood vessels, macula), quality control (focus, illumination, image field, artifacts), and pathologies (e.g., diabetic retinopathy). Diabetic retinopathy was graded using International Clinic Diabetic Retinopathy and Scottish Diabetic Retinopathy Grading. Validation used a ConvNext model trained during 50 epochs using a weighted cross entropy loss to avoid overfitting, with 70% training (20% validation), and 30% testing subsets. Performance metrics included area under the receiver operating curve (AUC) and Macro F1-score. Saliency maps were calculated for interpretability.
Results: BRSET comprises 65.1% Canon CR2 and 34.9% Nikon NF5050 images. 61.8% of the patients are female, and the average age is 57.6 (± 18.26) years. Diabetic retinopathy affected 15.8% of patients, across a spectrum of disease severity. Anatomically, 20.2% showed abnormal optic discs, 4.9% abnormal blood vessels, and 28.8% abnormal macula. A ConvNext V2 model was trained and evaluated BRSET in four prediction tasks: "binary diabetic retinopathy diagnosis (Normal vs Diabetic Retinopathy)" (AUC: 97, F1: 89); "3 class diabetic retinopathy diagnosis (Normal, Proliferative, Non-Proliferative)" (AUC: 97, F1: 82); "diabetes diagnosis" (AUC: 91, F1: 83); "sex classification" (AUC: 87, F1: 70).
Discussion: BRSET is the first multilabel ophthalmological dataset in Brazil and Latin America. It provides an opportunity for investigating model biases by evaluating performance across demographic groups. The model performance of three prediction tasks demonstrates the value of the dataset for external validation and for teaching medical computer vision to learners in Latin America using locally relevant data sources.
Copyright: © 2024 Nakayama et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures



Update of
-
BRSET: A Brazilian Multilabel Ophthalmological Dataset of Retina Fundus Photos.medRxiv [Preprint]. 2024 Jan 23:2024.01.23.24301660. doi: 10.1101/2024.01.23.24301660. medRxiv. 2024. Update in: PLOS Digit Health. 2024 Jul 11;3(7):e0000454. doi: 10.1371/journal.pdig.0000454. PMID: 38343827 Free PMC article. Updated. Preprint.
Similar articles
-
Automated machine learning model for fundus image classification by health-care professionals with no coding experience.Sci Rep. 2024 May 6;14(1):10395. doi: 10.1038/s41598-024-60807-y. Sci Rep. 2024. PMID: 38710726 Free PMC article.
-
BRSET: A Brazilian Multilabel Ophthalmological Dataset of Retina Fundus Photos.medRxiv [Preprint]. 2024 Jan 23:2024.01.23.24301660. doi: 10.1101/2024.01.23.24301660. medRxiv. 2024. Update in: PLOS Digit Health. 2024 Jul 11;3(7):e0000454. doi: 10.1371/journal.pdig.0000454. PMID: 38343827 Free PMC article. Updated. Preprint.
-
A portable retina fundus photos dataset for clinical, demographic, and diabetic retinopathy prediction.Sci Data. 2025 Feb 22;12(1):323. doi: 10.1038/s41597-025-04627-3. Sci Data. 2025. PMID: 39987104 Free PMC article.
-
Prognostic factors for the development and progression of proliferative diabetic retinopathy in people with diabetic retinopathy.Cochrane Database Syst Rev. 2023 Feb 22;2(2):CD013775. doi: 10.1002/14651858.CD013775.pub2. Cochrane Database Syst Rev. 2023. PMID: 36815723 Free PMC article. Review.
-
Diabetic retinopathy techniques in retinal images: A review.Artif Intell Med. 2019 Jun;97:168-188. doi: 10.1016/j.artmed.2018.10.009. Epub 2018 Nov 16. Artif Intell Med. 2019. PMID: 30448367 Review.
Cited by
-
Automated machine learning model for fundus image classification by health-care professionals with no coding experience.Sci Rep. 2024 May 6;14(1):10395. doi: 10.1038/s41598-024-60807-y. Sci Rep. 2024. PMID: 38710726 Free PMC article.
-
Selecting the Right AI Algorithm for the Job: A Guide for Navigating the AI Jungle in Ophthalmology.Ophthalmol Ther. 2025 Aug;14(8):1637-1647. doi: 10.1007/s40123-025-01191-2. Epub 2025 Jul 2. Ophthalmol Ther. 2025. PMID: 40601203 Free PMC article. No abstract available.
-
External Validation of Deep Learning Models for Classifying Etiology of Retinal Hemorrhage Using Diverse Fundus Photography Datasets.Bioengineering (Basel). 2024 Dec 29;12(1):20. doi: 10.3390/bioengineering12010020. Bioengineering (Basel). 2024. PMID: 39851294 Free PMC article.
-
The retinal age gap: an affordable and highly accessible biomarker for population-wide disease screening across the globe.Proc Biol Sci. 2025 May;292(2046):20242233. doi: 10.1098/rspb.2024.2233. Epub 2025 May 7. Proc Biol Sci. 2025. PMID: 40328303
-
Assessment of Clinical Metadata on the Accuracy of Retinal Fundus Image Labels in Diabetic Retinopathy in Uganda: Case-Crossover Study Using the Multimodal Database of Retinal Images in Africa.JMIR Form Res. 2024 Sep 18;8:e59914. doi: 10.2196/59914. JMIR Form Res. 2024. PMID: 39293049 Free PMC article.
References
-
- Bhaskaranand M, Ramachandra C, Bhat S, Cuadros J, Nittala MG, Sadda SR, et al.. The value of automated diabetic retinopathy screening with the EyeArt system: A study of more than 100,000 consecutive encounters from people with diabetes. Diabetes Technol Ther. 2019;21: 635–643. doi: 10.1089/dia.2019.0164 - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources