Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Nov 1;155(11):1291-1299.
doi: 10.1001/jamadermatol.2019.1375.

Accuracy of Computer-Aided Diagnosis of Melanoma: A Meta-analysis

Affiliations

Accuracy of Computer-Aided Diagnosis of Melanoma: A Meta-analysis

Vincent Dick et al. JAMA Dermatol. .

Abstract

Importance: The recent advances in the field of machine learning have raised expectations that computer-aided diagnosis will become the standard for the diagnosis of melanoma.

Objective: To critically review the current literature and compare the diagnostic accuracy of computer-aided diagnosis with that of human experts.

Data sources: The MEDLINE, arXiv, and PubMed Central databases were searched to identify eligible studies published between January 1, 2002, and December 31, 2018.

Study selection: Studies that reported on the accuracy of automated systems for melanoma were selected. Search terms included melanoma, diagnosis, detection, computer aided, and artificial intelligence.

Data extraction and synthesis: Evaluation of the risk of bias was performed using the QUADAS-2 tool, and quality assessment was based on predefined criteria. Data were analyzed from February 1 to March 10, 2019.

Main outcomes and measures: Summary estimates of sensitivity and specificity and summary receiver operating characteristic curves were the primary outcomes.

Results: The literature search yielded 1694 potentially eligible studies, of which 132 were included and 70 offered sufficient information for a quantitative analysis. Most studies came from the field of computer science. Prospective clinical studies were rare. Combining the results for automated systems gave a melanoma sensitivity of 0.74 (95% CI, 0.66-0.80) and a specificity of 0.84 (95% CI, 0.79-0.88). Sensitivity was lower in studies that used independent test sets than in those that did not (0.51; 95% CI, 0.34-0.69 vs 0.82; 95% CI, 0.77-0.86; P < .001); however, the specificity was similar (0.83; 95% CI, 0.71-0.91 vs 0.85; 95% CI, 0.80-0.88; P = .67). In comparison with dermatologists' diagnosis, computer-aided diagnosis showed similar sensitivities and a 10 percentage points lower specificity, but the difference was not statistically significant. Studies were heterogeneous and substantial risk of bias was found in all but 4 of the 70 studies included in the quantitative analysis.

Conclusions and relevance: Although the accuracy of computer-aided diagnosis for melanoma detection is comparable to that of experts, the real-world applicability of these systems is unknown and potentially limited owing to overfitting and the risk of bias of the studies at hand.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Dr Kittler reported nonfinancial support from Fotofinder and nonfinancial support from Derma Medical Systems outside the submitted work. Dr Tschandl reported grants from MetaOptima Technology Inc outside the submitted work. No other disclosures were reported.

Figures

Figure 1.
Figure 1.. Study Selection Process
Selection of studies according to inclusion and exclusion criteria at different stages of the meta-analysis.
Figure 2.
Figure 2.. Sensitivity and Specificity of 55 of 70 Included Studies
FN indicates false negative; FP, false positive; TN, true negative; and TP, true positive.
Figure 3.
Figure 3.. Sensitivity and Specificity of 15 of 70 Included Studies
FN indicates false negative; FP, false positive; TN, true negative; and TP, true positive.
Figure 4.
Figure 4.. Summary Receiver Operating Characteristic (ROC) Curves
Bivariate summary ROC curves comparing computer-aided diagnosis (CAD) and dermatologists for the detection of melanoma vs benign lesions in studies when both methods are available (A). Bivariate summary ROC curves comparing studies on automated systems for the detection of melanoma vs benign lesions using independent and nonindependent test sets (B), different CAD methods (C), and public or proprietary test data sets (D).

Similar articles

Cited by

References

    1. Rosado B, Menzies S, Harbauer A, et al. . Accuracy of computer diagnosis of melanoma: a quantitative meta-analysis. Arch Dermatol. 2003;139(3):361-367. doi:10.1001/archderm.139.3.361 - DOI - PubMed
    1. Dreiseitl S, Binder M, Hable K, Kittler H. Computer versus human diagnosis of melanoma: evaluation of the feasibility of an automated diagnostic system in a prospective clinical trial. Melanoma Res. 2009;19(3):180-184. doi:10.1097/CMR.0b013e32832a1e41 - DOI - PubMed
    1. Krizhevsky A, Sutskever I, Hinton GE. ImageNet classification with deep convolutional neural networks. Adv Neural Inf Process Syst. 2012;25(2):1097-1105.
    1. Esteva A, Kuprel B, Novoa RA, et al. . Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115-118. doi:10.1038/nature21056 - DOI - PMC - PubMed
    1. Whiting PF, Rutjes AW, Westwood ME, et al. ; QUADAS-2 Group . QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 2011;155(8):529-536. doi:10.7326/0003-4819-155-8-201110180-00009 - DOI - PubMed