AI Model Passport: Data and system traceability framework for transparent AI in health

Affiliations

¹ Institute of Computer Science, Foundation for Research and Technology Hellas (FORTH), Heraklion, Greece.
² Biomedical Research Institute, Foundation for Research and Technology Hellas (FORTH), Ioannina, Greece.
³ Institute of Information Science and Technologies (ISTI) National Research Council (CNR), Pisa, Italy.
⁴ Radiology Unit, Candiolo Cancer Institute, FPO-IRCCS, Candiolo, TO, Italy.
⁵ Computational Clinical Imaging Group, Champalimaud Foundation, Lisbon, Portugal.
⁶ Electrical and Computer Engineering, Hellenic Mediterranean University, Heraklion, Greece.
⁷ Unit of Medical Technology and Intelligent Information Systems, University of Ioannina, Ioannina, Greece.

PMID: 41113334
PMCID: PMC12528916
DOI: 10.1016/j.csbj.2025.09.041

AI Model Passport: Data and system traceability framework for transparent AI in health

Varvara Kalokyri et al. Comput Struct Biotechnol J. 2025.

. 2025 Oct 1:28:386-404.

doi: 10.1016/j.csbj.2025.09.041. eCollection 2025.

Authors

Affiliations

¹ Institute of Computer Science, Foundation for Research and Technology Hellas (FORTH), Heraklion, Greece.
² Biomedical Research Institute, Foundation for Research and Technology Hellas (FORTH), Ioannina, Greece.
³ Institute of Information Science and Technologies (ISTI) National Research Council (CNR), Pisa, Italy.
⁴ Radiology Unit, Candiolo Cancer Institute, FPO-IRCCS, Candiolo, TO, Italy.
⁵ Computational Clinical Imaging Group, Champalimaud Foundation, Lisbon, Portugal.
⁶ Electrical and Computer Engineering, Hellenic Mediterranean University, Heraklion, Greece.
⁷ Unit of Medical Technology and Intelligent Information Systems, University of Ioannina, Ioannina, Greece.

PMID: 41113334
PMCID: PMC12528916
DOI: 10.1016/j.csbj.2025.09.041

Abstract

The increasing integration of Artificial Intelligence (AI) into health and biomedical systems necessitates robust frameworks for transparency, accountability, and ethical compliance. Existing frameworks often rely on human-readable, manual documentation which limits scalability, comparability, and machine interpretability across projects and platforms. They also fail to provide a unique, verifiable identity for AI models to ensure their provenance and authenticity across systems and use cases, limiting reproducibility and stakeholder trust. This paper introduces the concept of the AI Model Passport, a structured and standardized documentation framework that acts as a digital identity and verification tool for AI models. It captures essential metadata to uniquely identify, verify, trace and monitor AI models across their lifecycle - from data acquisition and preprocessing to model design, development and deployment. In addition, an implementation of this framework is presented through AIPassport, an MLOps tool developed within the ProCAncer-I EU project for medical imaging applications. AIPassport automates metadata collection, ensures proper versioning, decouples results from source scripts, and integrates with various development environments. Its effectiveness is showcased through a lesion segmentation use case using data from the ProCAncer-I dataset, illustrating how the AI Model Passport enhances transparency, reproducibility, and regulatory readiness while reducing manual effort. This approach aims to set a new standard for fostering trust and accountability in AI-driven healthcare solutions, aspiring to serve as the basis for developing transparent and regulation compliant AI systems across domains.

Keywords: AI; F.U.T.U.R.E. AI; FAIR; MLOps; Medical Imaging; Ontologies; Reproducibility; Traceability; Transparency.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper

Figures

**Fig. 1**
AI model development lifecycle in the health domain.

**Fig. 2**
Class diagram representing the semantic structure of entities, activities, and agents involved in the AI data collection pipeline (on the left in purple) and data curation pipeline (on the right in green), extending PROV-O with domain-specific concepts.

**Fig. 3**
Workflow of the dataset specification step and use in the AI Model Passport framework.

**Fig. 4**
AI training and evaluation workflow provenance data representation.

**Fig. 5**
AIPassport infrastructure overview illustrating the integration of MLflow, DVC, Git, and MINIO to support versioning, experiment tracking, and metadata capture across the AI model development lifecycle.

**Fig. 6**
(a) Instance-level provenance graph capturing the data collection lifecycle for a patient case. The graph models the transformation of a raw patient record into an anonymized dataset and ultimately into a structured patient case, aligned with PROV-O domain-specific clinical and imaging metadata. (b) A simplified pipeline-style view of the data collection provenance graph for non-technical readers.

**Fig. 7**
(a) Instance graph of an image segmentation task, showing the input image series, segmentation activity, involved agents and tools, and the generated segmentation mask with its provenance. (b) A simplified pipeline-style view of the data curation provenance graph for non-technical readers.

**Fig. 8**
ProCAncer-I DCAT-AP extension for the prostate mpMR imaging datasets based on HealthDCATAP.

**Fig. 9**
(a) An excerpt of the information logged on each AI data preprocessing step. (b) A simplified pipeline-style view of the data preprocessing provenance graph for non-technical readers.

**Fig. 10**
(a) Instance-level provenance of a training workflow, showing hyperparameters used, evaluation measures, and the resulting model, compliant with PROV-O and MLS. (b) A simplified pipeline-style view of the training workflow for non-technical readers.

**Fig. 11**
AI model passport marketplace.

See this image and copyright information in PMC

References

1. World Health Organization, Ethics and governance of artificial intelligence for health: Who guidance, accessed: 2025-06-18 (2021). URL 〈https://www.who.int/publications/i/item/9789240029200〉.
1. Gille F., Jobin A., Ienca M. What we talk about when we talk about trust: theory of trust for ai in healthcare. IntellBased Med. 2020;1
1. Caspers J. Translation of predictive modeling and ai into clinics: a question of trust. Eur Radiol. 2021;31(7):4947–4948. doi: 10.1007/s00330-021-07977-9. URL https://doi.org/10.1007/s00330-021-07977-9. - DOI - PMC - PubMed
1. EU High Level Expert Group, Ethics guidelines for trustworthy ai, 〈https://tinyurl〉. com/4tej3t38, accessed: 2023-10-23 (2019).
1. European Commission, Proposal for a regulation of the european parliament and of the council laying down harmonised rules on artificial intelligence (artificial intelligence act), com/2021/206, 〈https://tinyurl.com/4aa9d6e7〉, accessed: 2023-10-23 (2021).

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

AI Model Passport: Data and system traceability framework for transparent AI in health

Affiliations

AI Model Passport: Data and system traceability framework for transparent AI in health

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

References

LinkOut - more resources

Full Text Sources