Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Jun 17;11(1):641.
doi: 10.1038/s41597-024-03387-w.

BCN20000: Dermoscopic Lesions in the Wild

Affiliations

BCN20000: Dermoscopic Lesions in the Wild

Carlos Hernández-Pérez et al. Sci Data. .

Abstract

Advancements in dermatological artificial intelligence research require high-quality and comprehensive datasets that mirror real-world clinical scenarios. We introduce a collection of 18,946 dermoscopic images spanning from 2010 to 2016, collated at the Hospital Clínic in Barcelona, Spain. The BCN20000 dataset aims to address the problem of unconstrained classification of dermoscopic images of skin cancer, including lesions in hard-to-diagnose locations such as those found in nails and mucosa, large lesions which do not fit in the aperture of the dermoscopy device, and hypo-pigmented lesions. Our dataset covers eight key diagnostic categories in dermoscopy, providing a diverse range of lesions for artificial intelligence model training. Furthermore, a ninth out-of-distribution (OOD) class is also present on the test set, comprised of lesions which could not be distinctively classified as any of the others. By providing a comprehensive collection of varied images, BCN20000 helps bridge the gap between the training data for machine learning models and the day-to-day practice of medical practitioners. Additionally, we present a set of baseline classifiers based on state-of-the-art neural networks, which can be extended by other researchers for further experimentation.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Fig. 1
Fig. 1
Dermoscopic images showcasing diagnostic challenges in BCN20000: (a) Lesion on a nail, (d) Lesion on mucosal tissue. (b) and (e) Lesions too extensive for the dermoscopy device aperture, illustrating size-related diagnostic obstacles. (c) and (f) present hypopigmented lesions.
Fig. 2
Fig. 2
Samples from the BCN20000 dataset corresponding to (a) nevus, (b) melanoma, (c) basal cell carcinoma, (d) solar lentigo/ seborrheic keratosis, (e) actinic keratosis, (f) squamous cell carcinoma, (g) dermatofibroma and (h) vascular lesion.
Fig. 3
Fig. 3
BCN20000 Dataset Preparation Pipeline. The process begins with the collection of images and metadata. A neural network is then employed to classify and separate between the image types. Patient identifiers are extracted from ‘Sticker pictures’ using a YOLOv3 network. Dermatoscopy’s diagnosis are revised by multiple reviewers for quality assurance. The resultant BCN20000 dataset is composed exclusively of dermoscopic images and metadata, divided into training and testing sets.
Fig. 4
Fig. 4
Example of the sticker detection algorithm. Left: original image (blurred for privacy), right: detection of the YOLOv3 Architecture.
Fig. 5
Fig. 5
Comparison between original (a,b,c) and processed dermoscopic images (d,e,f) using the proposed image cropping algorithm.

References

    1. Tripp MK, Watson M, Balk SJ, Swetter SM, Gershenwald JE. State of the science on prevention and screening to reduce melanoma incidence and mortality: The time is now. CA: a cancer journal for clinicians. 2016;66:460–480. - PMC - PubMed
    1. Bleyer A, Viny A, Barr R. Cancer in 15-to 29-year-olds by primary site. The oncologist. 2006;11:590–601. doi: 10.1634/theoncologist.11-6-590. - DOI - PubMed
    1. Hanly P, Soerjomataram I, Sharp L. Measuring the societal burden of cancer: The cost of lost productivity due to premature cancer-related mortality in e urope. International journal of cancer. 2015;136:E136–E145. doi: 10.1002/ijc.29105. - DOI - PubMed
    1. Argenziano G, et al. Dermoscopy of pigmented skin lesions: results of a consensus meeting via the internet. Journal of the American Academy of Dermatology. 2003;48:679–693. doi: 10.1067/mjd.2003.281. - DOI - PubMed
    1. Kittler H, Pehamberger H, Wolff K, Binder M. Diagnostic accuracy of dermoscopy. The lancet oncology. 2002;3:159–165. doi: 10.1016/S1470-2045(02)00679-4. - DOI - PubMed