Metadata for Data dIscoverability aNd Study rEplicability in obseRVAtional Studies (MINERVA): Development and Pilot of a Metadata List and Catalogue in Europe
- PMID: 39145406
- DOI: 10.1002/pds.5871
Metadata for Data dIscoverability aNd Study rEplicability in obseRVAtional Studies (MINERVA): Development and Pilot of a Metadata List and Catalogue in Europe
Abstract
Purpose: Metadata for data dIscoverability aNd study rEplicability in obseRVAtional studies (MINERVA), a European Medicines Agency-funded project (EUPAS39322), defined a set of metadata to describe real-world data sources (RWDSs) and piloted metadata collection in a prototype catalogue to assist investigators from data source discoverability through study conduct.
Methods: A list of metadata was created from a review of existing metadata catalogues and recommendations, structured interviews, a stakeholder survey, and a technical workshop. The prototype was designed to comply with the FAIR principles (findable, accessible, interoperable, reusable), using MOLGENIS software. Metadata collection was piloted by 15 data access partners (DAPs) from across Europe.
Results: A total of 442 metadata variables were defined in six domains: institutions (organizations connected to a data source); data banks (data collections sustained by an organization); data sources (collections of linkable data banks covering a common underlying population); studies; networks (of institutions); and common data models (CDMs). A total of 26 institutions were recorded in the prototype. Each DAP populated the metadata of one data source and its selected data banks. The number of data banks varied by data source; the most common data banks were hospital administrative records and pharmacy dispensation records (10 data sources each). Quantitative metadata were successfully extracted from three data sources conforming to different CDMs and entered into the prototype.
Conclusions: A metadata list was finalized, a prototype was successfully populated, and a good practice guide was developed. Setting up and maintaining a metadata catalogue on RWDSs will require substantial effort to support discoverability of data sources and reproducibility of studies in Europe.
Keywords: FAIR; catalogue; metadata; observational studies; real‐world data sources; reproducibility.
© 2024 The Author(s). Pharmacoepidemiology and Drug Safety published by John Wiley & Sons Ltd.
References
-
- X. Kurz, “Darwin EU—First Experience and Regulatory Use Cases,” Data Analytics and Methods Taskforce, European Medicines Agency. EMA/HMA Big Data Stakeholder Forum 2022, accessed December 15, 2022, https://www.ema.europa.eu/en/documents/presentation/session‐2‐1‐darwin‐e....
-
- M. Wilkinson, M. Dumontier, I. Aalbersberg, et al., “The FAIR Guiding Principles for Scientific Data Management and Stewardship,” Scientific Data 15, no. 3 (2016): 160018, https://doi.org/10.1038/sdata.2016.18.
-
- Heads of Medicines Agencies, European Medicines Agency, “HMA–EMA Joint Big Data Taskforce Phase II Report: Evolving Data‐Driven Regulation,” accessed October 11, 2022, https://www.ema.europa.eu/en/documents/other/hma‐ema‐joint‐big‐data‐task....
-
- Heads of Medicines Agencies, European Medicines Agency, “Priority Recommendations of the HMA–EMA Joint Big Data Task Force,” HMA–EMA Big Data Steering Committee Group, accessed January 9, 2023, https://www.ema.europa.eu/en/documents/other/priority‐recommendations‐hm....
-
- Heads of Medicines Agencies, European Medicines Agency, “Big Data Steering Group,” accessed January 9, 2023, https://www.ema.europa.eu/system/files/documents/work‐programme/workplan....
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources