Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Oct 1:28:386-404.
doi: 10.1016/j.csbj.2025.09.041. eCollection 2025.

AI Model Passport: Data and system traceability framework for transparent AI in health

Affiliations

AI Model Passport: Data and system traceability framework for transparent AI in health

Varvara Kalokyri et al. Comput Struct Biotechnol J. .

Abstract

The increasing integration of Artificial Intelligence (AI) into health and biomedical systems necessitates robust frameworks for transparency, accountability, and ethical compliance. Existing frameworks often rely on human-readable, manual documentation which limits scalability, comparability, and machine interpretability across projects and platforms. They also fail to provide a unique, verifiable identity for AI models to ensure their provenance and authenticity across systems and use cases, limiting reproducibility and stakeholder trust. This paper introduces the concept of the AI Model Passport, a structured and standardized documentation framework that acts as a digital identity and verification tool for AI models. It captures essential metadata to uniquely identify, verify, trace and monitor AI models across their lifecycle - from data acquisition and preprocessing to model design, development and deployment. In addition, an implementation of this framework is presented through AIPassport, an MLOps tool developed within the ProCAncer-I EU project for medical imaging applications. AIPassport automates metadata collection, ensures proper versioning, decouples results from source scripts, and integrates with various development environments. Its effectiveness is showcased through a lesion segmentation use case using data from the ProCAncer-I dataset, illustrating how the AI Model Passport enhances transparency, reproducibility, and regulatory readiness while reducing manual effort. This approach aims to set a new standard for fostering trust and accountability in AI-driven healthcare solutions, aspiring to serve as the basis for developing transparent and regulation compliant AI systems across domains.

Keywords: AI; F.U.T.U.R.E. AI; FAIR; MLOps; Medical Imaging; Ontologies; Reproducibility; Traceability; Transparency.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper

Figures

Fig. 1
Fig. 1
AI model development lifecycle in the health domain.
Fig. 2
Fig. 2
Class diagram representing the semantic structure of entities, activities, and agents involved in the AI data collection pipeline (on the left in purple) and data curation pipeline (on the right in green), extending PROV-O with domain-specific concepts.
Fig. 3
Fig. 3
Workflow of the dataset specification step and use in the AI Model Passport framework.
Fig. 4
Fig. 4
AI training and evaluation workflow provenance data representation.
Fig. 5
Fig. 5
AIPassport infrastructure overview illustrating the integration of MLflow, DVC, Git, and MINIO to support versioning, experiment tracking, and metadata capture across the AI model development lifecycle.
Fig. 6
Fig. 6
(a) Instance-level provenance graph capturing the data collection lifecycle for a patient case. The graph models the transformation of a raw patient record into an anonymized dataset and ultimately into a structured patient case, aligned with PROV-O domain-specific clinical and imaging metadata. (b) A simplified pipeline-style view of the data collection provenance graph for non-technical readers.
Fig. 7
Fig. 7
(a) Instance graph of an image segmentation task, showing the input image series, segmentation activity, involved agents and tools, and the generated segmentation mask with its provenance. (b) A simplified pipeline-style view of the data curation provenance graph for non-technical readers.
Fig. 8
Fig. 8
ProCAncer-I DCAT-AP extension for the prostate mpMR imaging datasets based on HealthDCATAP.
Fig. 9
Fig. 9
(a) An excerpt of the information logged on each AI data preprocessing step. (b) A simplified pipeline-style view of the data preprocessing provenance graph for non-technical readers.
Fig. 10
Fig. 10
(a) Instance-level provenance of a training workflow, showing hyperparameters used, evaluation measures, and the resulting model, compliant with PROV-O and MLS. (b) A simplified pipeline-style view of the training workflow for non-technical readers.
Fig. 11
Fig. 11
AI model passport marketplace.

References

    1. World Health Organization, Ethics and governance of artificial intelligence for health: Who guidance, accessed: 2025-06-18 (2021). URL 〈https://www.who.int/publications/i/item/9789240029200〉.
    1. Gille F., Jobin A., Ienca M. What we talk about when we talk about trust: theory of trust for ai in healthcare. IntellBased Med. 2020;1
    1. Caspers J. Translation of predictive modeling and ai into clinics: a question of trust. Eur Radiol. 2021;31(7):4947–4948. doi: 10.1007/s00330-021-07977-9. URL https://doi.org/10.1007/s00330-021-07977-9. - DOI - PMC - PubMed
    1. EU High Level Expert Group, Ethics guidelines for trustworthy ai, 〈https://tinyurl〉. com/4tej3t38, accessed: 2023-10-23 (2019).
    1. European Commission, Proposal for a regulation of the european parliament and of the council laying down harmonised rules on artificial intelligence (artificial intelligence act), com/2021/206, 〈https://tinyurl.com/4aa9d6e7〉, accessed: 2023-10-23 (2021).

LinkOut - more resources