. 2024 Mar 1;160(3):303-311.

doi: 10.1001/jamadermatol.2023.5550.

Federated Learning for Decentralized Artificial Intelligence in Melanoma Diagnostics

Sarah Haggenmüller¹, Max Schmitt¹, Eva Krieghoff-Henning¹, Achim Hekler¹, Roman C Maron¹, Christoph Wies¹, Jochen S Utikal^{2

3

4}, Friedegund Meier⁵, Sarah Hobelsberger⁵, Frank F Gellrich⁵, Mildred Sergon⁵, Axel Hauschild⁶, Lars E French^{7

8}, Lucie Heinzerling^{7

9}, Justin G Schlager⁷, Kamran Ghoreschi¹⁰, Max Schlaak¹⁰, Franz J Hilke¹⁰, Gabriela Poch¹⁰, Sören Korsing¹⁰, Carola Berking⁹, Markus V Heppt⁹, Michael Erdmann⁹, Sebastian Haferkamp¹¹, Konstantin Drexler¹¹, Dirk Schadendorf¹², Wiebke Sondermann¹², Matthias Goebeler¹³, Bastian Schilling¹³, Jakob N Kather¹⁴, Stefan Fröhling¹⁵, Titus J Brinker¹

Affiliations

¹ Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany.
² Department of Dermatology, Venereology and Allergology, University Medical Center Mannheim, Ruprecht-Karls University of Heidelberg, Mannheim, Germany.
³ Skin Cancer Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany.
⁴ DKFZ Hector Cancer Institute at the University Medical Center Mannheim, Mannheim, Germany.
⁵ Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany.
⁶ Department of Dermatology, University Hospital (UKSH), Kiel, Germany.
⁷ Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany.
⁸ Dr Phillip Frost Department of Dermatology and Cutaneous Surgery, Miller School of Medicine, University of Miami, Miami, Florida.
⁹ Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen-European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany.
¹⁰ Department of Dermatology, Venereology and Allergology, Charité-Universitätsmedizin Berlin, Corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.
¹¹ Department of Dermatology, University Hospital Regensburg, Regensburg, Germany.
¹² Department of Dermatology, Venereology and Allergology, University Hospital Essen, Essen, Germany.
¹³ Department of Dermatology, Venereology and Allergology, University Hospital Würzburg and National Center for Tumor Diseases (NCT) WERA, Würzburg, Germany.
¹⁴ Else Kroener Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany.
¹⁵ Department of Translational Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg and German Cancer Research Center (DKFZ), Heidelberg, Germany.

PMID: 38324293
PMCID: PMC10851139
DOI: 10.1001/jamadermatol.2023.5550

Federated Learning for Decentralized Artificial Intelligence in Melanoma Diagnostics

Sarah Haggenmüller et al. JAMA Dermatol. 2024.

. 2024 Mar 1;160(3):303-311.

doi: 10.1001/jamadermatol.2023.5550.

Authors

Affiliations

¹ Digital Biomarkers for Oncology Group, National Center for Tumor Diseases (NCT), German Cancer Research Center (DKFZ), Heidelberg, Germany.
² Department of Dermatology, Venereology and Allergology, University Medical Center Mannheim, Ruprecht-Karls University of Heidelberg, Mannheim, Germany.
³ Skin Cancer Unit, German Cancer Research Center (DKFZ), Heidelberg, Germany.
⁴ DKFZ Hector Cancer Institute at the University Medical Center Mannheim, Mannheim, Germany.
⁵ Skin Cancer Center at the University Cancer Center and National Center for Tumor Diseases Dresden, Department of Dermatology, University Hospital Carl Gustav Carus, Technische Universität Dresden, Dresden, Germany.
⁶ Department of Dermatology, University Hospital (UKSH), Kiel, Germany.
⁷ Department of Dermatology and Allergy, University Hospital, LMU Munich, Munich, Germany.
⁸ Dr Phillip Frost Department of Dermatology and Cutaneous Surgery, Miller School of Medicine, University of Miami, Miami, Florida.
⁹ Department of Dermatology, University Hospital Erlangen, Comprehensive Cancer Center Erlangen-European Metropolitan Region Nürnberg, CCC Alliance WERA, Erlangen, Germany.
¹⁰ Department of Dermatology, Venereology and Allergology, Charité-Universitätsmedizin Berlin, Corporate member of Freie Universität Berlin and Humboldt-Universität zu Berlin, Berlin, Germany.
¹¹ Department of Dermatology, University Hospital Regensburg, Regensburg, Germany.
¹² Department of Dermatology, Venereology and Allergology, University Hospital Essen, Essen, Germany.
¹³ Department of Dermatology, Venereology and Allergology, University Hospital Würzburg and National Center for Tumor Diseases (NCT) WERA, Würzburg, Germany.
¹⁴ Else Kroener Fresenius Center for Digital Health, Technical University Dresden, Dresden, Germany.
¹⁵ Department of Translational Medical Oncology, National Center for Tumor Diseases (NCT) Heidelberg and German Cancer Research Center (DKFZ), Heidelberg, Germany.

PMID: 38324293
PMCID: PMC10851139
DOI: 10.1001/jamadermatol.2023.5550

Abstract

Importance: The development of artificial intelligence (AI)-based melanoma classifiers typically calls for large, centralized datasets, requiring hospitals to give away their patient data, which raises serious privacy concerns. To address this concern, decentralized federated learning has been proposed, where classifier development is distributed across hospitals.

Objective: To investigate whether a more privacy-preserving federated learning approach can achieve comparable diagnostic performance to a classical centralized (ie, single-model) and ensemble learning approach for AI-based melanoma diagnostics.

Design, setting, and participants: This multicentric, single-arm diagnostic study developed a federated model for melanoma-nevus classification using histopathological whole-slide images prospectively acquired at 6 German university hospitals between April 2021 and February 2023 and benchmarked it using both a holdout and an external test dataset. Data analysis was performed from February to April 2023.

Exposures: All whole-slide images were retrospectively analyzed by an AI-based classifier without influencing routine clinical care.

Main outcomes and measures: The area under the receiver operating characteristic curve (AUROC) served as the primary end point for evaluating the diagnostic performance. Secondary end points included balanced accuracy, sensitivity, and specificity.

Results: The study included 1025 whole-slide images of clinically melanoma-suspicious skin lesions from 923 patients, consisting of 388 histopathologically confirmed invasive melanomas and 637 nevi. The median (range) age at diagnosis was 58 (18-95) years for the training set, 57 (18-93) years for the holdout test dataset, and 61 (18-95) years for the external test dataset; the median (range) Breslow thickness was 0.70 (0.10-34.00) mm, 0.70 (0.20-14.40) mm, and 0.80 (0.30-20.00) mm, respectively. The federated approach (0.8579; 95% CI, 0.7693-0.9299) performed significantly worse than the classical centralized approach (0.9024; 95% CI, 0.8379-0.9565) in terms of AUROC on a holdout test dataset (pairwise Wilcoxon signed-rank, P < .001) but performed significantly better (0.9126; 95% CI, 0.8810-0.9412) than the classical centralized approach (0.9045; 95% CI, 0.8701-0.9331) on an external test dataset (pairwise Wilcoxon signed-rank, P < .001). Notably, the federated approach performed significantly worse than the ensemble approach on both the holdout (0.8867; 95% CI, 0.8103-0.9481) and external test dataset (0.9227; 95% CI, 0.8941-0.9479).

Conclusions and relevance: The findings of this diagnostic study suggest that federated learning is a viable approach for the binary classification of invasive melanomas and nevi on a clinically representative distributed dataset. Federated learning can improve privacy protection in AI-based melanoma diagnostics while simultaneously promoting collaboration across institutions and countries. Moreover, it may have the potential to be extended to other image classification tasks in digital cancer histopathology and beyond.

PubMed Disclaimer

Conflict of interest statement

Conflict of Interest Disclosures: Ms Haggenmüller reported grants from Federal Ministry of Health, Berlin, Germany (grants: Skin Classification Project 2 [SCP2] and Tumor Behavior Prediction Initiative [TPI]; grant holder in both cases: Titus J. Brinker, German Cancer Research Center, Heidelberg, Germany) during the conduct of the study. Dr Krieghoff-Henning reported grants from German Federal Ministry of Health during the conduct of the study. Mr Hekler reported grants from German Federal Ministry of Health during the conduct of the study. Mr Maron reported grants from German Federal Ministry of Health during the conduct of the study. Prof Utikal reported personal fees from Amgen, Bristol Myers Squibb, GSK, Immunocore, LEO Pharma, Merck Sharp & Dohme, Novartis, Pierre Fabre, Roche, and Sanofi outside the submitted work. Prof Meier reported grants from Novartis and Roche; other (travel support or/and speaker’s fees or/and advisor’s honoraria) from BMS, MSD, and Pierre Fabre outside the submitted work. Dr Hobelsberger reported clinical trial support from Almirall and speaker’s honoraria from Almirall, UCB, and AbbVie and travel support from UCB, Janssen Cilag, Almirall, Novartis, Lilly, LEO Pharma, and AbbVie outside the submitted work. Prof Heinzerling reported other (clinical studies) from BMS, MSD, Pierre Fabre, Replimune, and Sanofi; personal fees from Biomedx, BMS, MSD, Sun, Pierre Fabre, Novartis, and Sanofi; and grants from Therakos outside the submitted work. Dr Schlaak reported personal fees from BMS, Novartis, Immunocore, Sun Pharma, MSD, Recordati, and Sanofi Aventis outside the submitted work. Prof Berking reported grants from BMG Bundesministerium für Gesundheit to institute during the conduct of the study; personal fees from BMS, MSD, InflaRx, Novartis, Sanofi, LEO Pharma, Almirall Hermal, Pierre Fabre, Immunocore, and Delcath outside the submitted work. Dr Sondermann reported grants from Almirall and Medi GmbH; and personal fees from AbbVie, BMS, Boehringer Ingelheim, Celgene, Janssen, LEO Pharma, Lilly, Novartis, Pfizer, Sanofi Genzyme, and UCB outside the submitted work. Prof Goebeler reported grants from DKFZ Heidelberg during the conduct of the study; grants (clinical study) from Argenx, Novartis, Janssen, and Galderma; personal fees from Almirall (consulting), Janssen (advisory board, speaker), GSK (advisory board, speaker), and Lilly (speaker) outside the submitted work. Prof Kather reported personal fees from Owkin, Panakeia, DoMore Diagnostics, Histofy, Roche, MSD, BMS, Eisai, Bayer, Fresenius, and Pfizer outside the submitted work. Dr Brinker reported being owner of Smart Health Heidelberg GmbH outside the submitted work. No other disclosures were reported.

Figures

**Figure 1.. Flowchart of the Slide Inclusion Process**
Slides were excluded from the analysis if there was no histopathologically confirmed label available or if the lesion proved to be neither invasive melanoma (IM) nor nevus (in situ tumors or other diagnoses, eg, basal cell carcinoma, squamous cell carcinoma). In addition, slides that exhibited fewer than 50 epidermal patches or other technical issues were removed.

**Figure 2.. Mean Area Under the Receiver Operating Characteristic Curve (AUROC) of the 3 Investigated Approaches**
Mean AUROCs on the holdout and external test dataset after 1000 iterations of bootstrapping, including the corresponding 95% CIs (shaded areas), are illustrated for the federated learning (FL) and the centralized approach (model Hfull) (A and B) and for the FL and the ensemble approach (C and D). AUC indicates area under the curve.

See this image and copyright information in PMC

Comment in

The Promise and Drawbacks of Federated Learning for Dermatology AI.
Kose K, Rotemberg V. Kose K, et al. JAMA Dermatol. 2024 Mar 1;160(3):269-270. doi: 10.1001/jamadermatol.2023.5410. JAMA Dermatol. 2024. PMID: 38324308 No abstract available.

References

1. McKinney SM, Sieniek M, Godbole V, et al. International evaluation of an AI system for breast cancer screening. Nature. 2020;577(7788):89-94. doi: 10.1038/s41586-019-1799-6 - DOI - PubMed
1. Bulten W, Kartasalo K, Chen PC, et al. ; PANDA challenge consortium . Artificial intelligence for diagnosis and Gleason grading of prostate cancer: the PANDA challenge. Nat Med. 2022;28(1):154-163. doi: 10.1038/s41591-021-01620-2 - DOI - PMC - PubMed
1. Mei X, Lee HC, Diao KY, et al. Artificial intelligence-enabled rapid diagnosis of patients with COVID-19. Nat Med. 2020;26(8):1224-1228. doi: 10.1038/s41591-020-0931-3 - DOI - PMC - PubMed
1. Esteva A, Kuprel B, Novoa RA, et al. Dermatologist-level classification of skin cancer with deep neural networks. Nature. 2017;542(7639):115-118. doi: 10.1038/nature21056 - DOI - PMC - PubMed
1. Haggenmüller S, Maron RC, Hekler A, et al. Skin cancer classification via convolutional neural networks: systematic review of studies involving human experts. Eur J Cancer. 2021;156:202-216. doi: 10.1016/j.ejca.2021.06.049 - DOI - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Medical
- MedlinePlus Health Information
Miscellaneous
- NCI CPTAC Assay Portal

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Federated Learning for Decentralized Artificial Intelligence in Melanoma Diagnostics

Affiliations

Federated Learning for Decentralized Artificial Intelligence in Melanoma Diagnostics

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Comment in

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Medical

Miscellaneous