Comparative Study

. 2018 Feb;78(2):270-277.e1.

doi: 10.1016/j.jaad.2017.08.016. Epub 2017 Sep 29.

Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images

Michael A Marchetti¹, Noel C F Codella², Stephen W Dusza¹, David A Gutman³, Brian Helba⁴, Aadi Kalloo¹, Nabin Mishra⁵, Cristina Carrera⁶, M Emre Celebi⁷, Jennifer L DeFazio¹, Natalia Jaimes⁸, Ashfaq A Marghoob¹, Elizabeth Quigley¹, Alon Scope⁹, Oriol Yélamos¹, Allan C Halpern¹⁰; International Skin Imaging Collaboration

Affiliations

¹ Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York.
² IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, New York.
³ Departments of Neurology, Psychiatry, and Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia.
⁴ Kitware Inc, Clifton Park, New York.
⁵ Stoecker & Associates, Rolla, Missouri.
⁶ Melanoma Unit, Department of Dermatology, Hospital Clinic, Institut d'Investigacions Biomèdiques August Pi i Sunyer, CIBER de Enfermedades Raras, Instituto de Salud Carlos III, University of Barcelona, Barcelona, Spain.
⁷ Department of Computer Science, University of Central Arkansas, Conway, Arkansas.
⁸ Dermatology Service, Aurora Centro Especializado en Cáncer de Piel, Medellín, Colombia; Department of Dermatology and Cutaneous Surgery, University of Miami Miller School of Medicine, Miami, Florida.
⁹ Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York; Department of Dermatology, Sheba Medical Center, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel.
¹⁰ Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York. Electronic address: halperna@mskcc.org.

PMID: 28969863
PMCID: PMC5768444
DOI: 10.1016/j.jaad.2017.08.016

Comparative Study

Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images

Michael A Marchetti et al. J Am Acad Dermatol. 2018 Feb.

. 2018 Feb;78(2):270-277.e1.

doi: 10.1016/j.jaad.2017.08.016. Epub 2017 Sep 29.

Authors

Affiliations

¹ Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York.
² IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, New York.
³ Departments of Neurology, Psychiatry, and Biomedical Informatics, Emory University School of Medicine, Atlanta, Georgia.
⁴ Kitware Inc, Clifton Park, New York.
⁵ Stoecker & Associates, Rolla, Missouri.
⁶ Melanoma Unit, Department of Dermatology, Hospital Clinic, Institut d'Investigacions Biomèdiques August Pi i Sunyer, CIBER de Enfermedades Raras, Instituto de Salud Carlos III, University of Barcelona, Barcelona, Spain.
⁷ Department of Computer Science, University of Central Arkansas, Conway, Arkansas.
⁸ Dermatology Service, Aurora Centro Especializado en Cáncer de Piel, Medellín, Colombia; Department of Dermatology and Cutaneous Surgery, University of Miami Miller School of Medicine, Miami, Florida.
⁹ Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York; Department of Dermatology, Sheba Medical Center, Sackler School of Medicine, Tel Aviv University, Tel Aviv, Israel.
¹⁰ Dermatology Service, Department of Medicine, Memorial Sloan Kettering Cancer Center, New York, New York. Electronic address: halperna@mskcc.org.

PMID: 28969863
PMCID: PMC5768444
DOI: 10.1016/j.jaad.2017.08.016

Abstract

Background: Computer vision may aid in melanoma detection.

Objective: We sought to compare melanoma diagnostic accuracy of computer algorithms to dermatologists using dermoscopic images.

Methods: We conducted a cross-sectional study using 100 randomly selected dermoscopic images (50 melanomas, 44 nevi, and 6 lentigines) from an international computer vision melanoma challenge dataset (n = 379), along with individual algorithm results from 25 teams. We used 5 methods (nonlearned and machine learning) to combine individual automated predictions into "fusion" algorithms. In a companion study, 8 dermatologists classified the lesions in the 100 images as either benign or malignant.

Results: The average sensitivity and specificity of dermatologists in classification was 82% and 59%. At 82% sensitivity, dermatologist specificity was similar to the top challenge algorithm (59% vs. 62%, P = .68) but lower than the best-performing fusion algorithm (59% vs. 76%, P = .02). Receiver operating characteristic area of the top fusion algorithm was greater than the mean receiver operating characteristic area of dermatologists (0.86 vs. 0.71, P = .001).

Limitations: The dataset lacked the full spectrum of skin lesions encountered in clinical practice, particularly banal lesions. Readers and algorithms were not provided clinical data (eg, age or lesion history/symptoms). Results obtained using our study design cannot be extrapolated to clinical practice.

Conclusion: Deep learning computer vision systems classified melanoma dermoscopy images with accuracy that exceeded some but not all dermatologists.

Keywords: International Skin Imaging Collaboration; International Symposium on Biomedical Imaging; computer algorithm; computer vision; dermatologist; machine learning; melanoma; reader study; skin cancer.

PubMed Disclaimer

Conflict of interest statement

Conflicts of interest: None declared.

Figures

**Figure 1. Algorithm probability scores**
Mean probability score for top five algorithms and best fusion algorithm (Greedy) by lesion diagnosis (i.e., benign nevi or lentigines, melanoma in situ, and invasive melanoma). Probability scores from computer algorithms were in the range 0 to 1, with scores closer to 0 indicating a greater probability of a benign diagnosis and scores closer to 1 indicating a greater probability of a malignant diagnosis. The upper and lower bounds of the boxed area represent the 25^th and 75^th percentiles, the line transecting the box is the median value, and whiskers indicate the 5% and 95% percentiles. Dots that fall outside of the whiskers indicate extreme, or outlier values.

**Figure 2. Diagnostic accuracy of algorithms and dermatologists for melanoma on 100 image dataset**
Receiver operating characteristic curves demonstrating sensitivity and specificity for melanoma of (A) top five ranked individual algorithms and (B) five fusion algorithms, with melanoma classification and management performance of eight dermatologists indicated by small colored solid circles and triangles, respectively. Small colored solid circles and triangles of the same color indicate the performance of an individual dermatologist. The large transparent circle and triangle with black outline indicate the average diagnostic performance of dermatologists in classification and management, respectively.

See this image and copyright information in PMC

References

1. Marghoob AA, Scope A. The complexity of diagnosing melanoma. J Invest Dermatol. 2009;129(1):11–13. - PubMed
1. Malvehy J, Hauschild A, Curiel-Lewandrowski C, et al. Clinical performance of the Nevisense system in cutaneous melanoma detection: an international, multicentre, prospective and blinded clinical trial on efficacy and safety. Br J Dermatol. 2014;171(5):1099–1107. - PMC - PubMed
1. Monheit G, Cognetta AB, Ferris L, et al. The performance of MelaFind: a prospective multicenter study. Arch Dermatol. 2011;147(2):188–194. - PubMed
1. Brady MS, Oliveria SA, Christos PJ, et al. Patterns of detection in patients with cutaneous melanoma. Cancer. 2000;89(2):342–347. - PubMed
1. Bibbins-Domingo K, Grossman DC, Curry SJ, et al. Screening for Skin Cancer: US Preventive Services Task Force Recommendation Statement. Jama. 2016;316(4):429–435. - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

P30 CA008748/CA/NCI NIH HHS/United States

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations
Medical
- MedlinePlus Health Information

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images

Affiliations

Results of the 2016 International Skin Imaging Collaboration International Symposium on Biomedical Imaging challenge: Comparison of the accuracy of computer algorithms to dermatologists for the diagnosis of melanoma from dermoscopic images

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Medical