Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Sep 27;6(1):180.
doi: 10.1038/s41746-023-00914-8.

Systematic review of deep learning image analyses for the diagnosis and monitoring of skin disease

Affiliations
Review

Systematic review of deep learning image analyses for the diagnosis and monitoring of skin disease

Shern Ping Choy et al. NPJ Digit Med. .

Abstract

Skin diseases affect one-third of the global population, posing a major healthcare burden. Deep learning may optimise healthcare workflows through processing skin images via neural networks to make predictions. A focus of deep learning research is skin lesion triage to detect cancer, but this may not translate to the wider scope of >2000 other skin diseases. We searched for studies applying deep learning to skin images, excluding benign/malignant lesions (1/1/2000-23/6/2022, PROSPERO CRD42022309935). The primary outcome was accuracy of deep learning algorithms in disease diagnosis or severity assessment. We modified QUADAS-2 for quality assessment. Of 13,857 references identified, 64 were included. The most studied diseases were acne, psoriasis, eczema, rosacea, vitiligo, urticaria. Deep learning algorithms had high specificity and variable sensitivity in diagnosing these conditions. Accuracy of algorithms in diagnosing acne (median 94%, IQR 86-98; n = 11), rosacea (94%, 90-97; n = 4), eczema (93%, 90-99; n = 9) and psoriasis (89%, 78-92; n = 8) was high. Accuracy for grading severity was highest for psoriasis (range 93-100%, n = 2), eczema (88%, n = 1), and acne (67-86%, n = 4). However, 59 (92%) studies had high risk-of-bias judgements and 62 (97%) had high-level applicability concerns. Only 12 (19%) reported participant ethnicity/skin type. Twenty-four (37.5%) evaluated the algorithm in an independent dataset, clinical setting or prospectively. These data indicate potential of deep learning image analysis in diagnosing and monitoring common skin diseases. Current research has important methodological/reporting limitations. Real-world, prospectively-acquired image datasets with external validation/testing will advance deep learning beyond the current experimental phase towards clinically-useful tools to mitigate rising health and cost impacts of skin disease.

PubMed Disclaimer

Conflict of interest statement

J.N.W.N.B. has attended advisory boards and/or spoken at sponsored symposia and/or received research funding from: AbbVie, Almirall, Amgen, Boehringer-Ingelheim, Bristol Myers Squibb, Celgene, Janssen, Leo, Lilly, Novartis, Samsung, Sun Pharma. C.H.S. reports departmental research funding as investigator in EU-IMI consortia involving multiple industry partners (see biomap-imi.eu and hippocrates-imi.eu for details). S.K.M. reports departmental income from Abbvie, Almirall, Eli Lilly, Leo, Novartis, Sanofi, UCB, outside the submitted work. There were no conflicts of interest reported by the remaining authors.

Figures

Fig. 1
Fig. 1. PRISMA flowchart of study records.
PRISMA flowchart showing the study selection process.
Fig. 2
Fig. 2. Summary of quality assessment results of all studies (using modified QUADAS-2).
Quality assessment of all included studies.
Fig. 3
Fig. 3. Type of deep learning algorithms included in the systematic review, by year of publication.
The number of different types of deep learning (DL) algorithms are presented by year of publication of the studies. As five studies used multiple DL algorithms, the total number of algorithms sum to 69 across the 64 included studies.

References

    1. Esteva A, et al. A guide to deep learning in healthcare. Nat. Med. 2019;25:24–29. doi: 10.1038/s41591-018-0316-z. - DOI - PubMed
    1. Du-Harpur X, Watt FM, Luscombe NM, Lynch MD. What is AI? Applications of artificial intelligence to dermatology. Br. J. Dermatol. 2020;183:423–30.. doi: 10.1111/bjd.18880. - DOI - PMC - PubMed
    1. LeCun Y, Bengio Y, Hinton G. Deep learning. Nature. 2015;521:436–444. doi: 10.1038/nature14539. - DOI - PubMed
    1. Brownlee J. What is the Difference Between Test and Validation Datasets? https://machinelearningmastery.com/difference-test-validation-datasets/ (2023).
    1. Nagendran M, et al. Artificial intelligence versus clinicians: systematic review of design, reporting standards, and claims of deep learning studies. BMJ. 2020;368:m689. doi: 10.1136/bmj.m689. - DOI - PMC - PubMed