Scorecard for synthetic medical data evaluation
- PMID: 40691520
- PMCID: PMC12280076
- DOI: 10.1038/s44172-025-00450-1
Scorecard for synthetic medical data evaluation
Abstract
Although the interest in synthetic medical data (SMD) for developing and testing artificial intelligence (AI) methods is growing, the absence of a comprehensive framework to evaluate the quality and applicability of SMD hinders its wider adoption. Here, we outline an evaluation framework designed to meet the unique requirements of medical applications. We also introduce SMD scorecard, a comprehensive report accompanying artificially generated datasets. This scorecard provides a quantitative assessment of SMD across seven criteria (7 Cs), complemented by a descriptive section that contains all relevant information about the dataset. The SMD scorecard provides a practical framework for evaluating and reporting the quality of synthetic data, which can benefit SMD developers and users.
Conflict of interest statement
Competing interests: The authors declare no competing interests.
Figures


References
-
- Sizikova, E. et al. Synthetic data in radiological imaging: Current state and future outlook. BJR∣ Artificial Intelligence ubae007 (2024).
-
- Borji, A. Pros and cons of gan evaluation measures: New developments. Computer Vis. Image Underst.215, 103329 (2022).
-
- Chang, Y. et al. A survey on evaluation of large language models. ACM Trans. Intell. Syst. Technol.15, 1–45 (2024).
-
- Dankar, F. K., Ibrahim, M. K. & Ismail, L. A multi-dimensional evaluation of synthetic data generators. IEEE Access10, 11147–11158 (2022).