Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Dec 5;28(1):28.
doi: 10.1208/s12248-025-01168-w.

Improvements in Data Quality Can Boost Efficiency and Reduce Development Costs: A Pharmacometric CRO's Perspective

Affiliations

Improvements in Data Quality Can Boost Efficiency and Reduce Development Costs: A Pharmacometric CRO's Perspective

Amparo de la Peña et al. AAPS J. .

Abstract

Drug development can take up to 15 years, costing as much as $11 billion USD, and relies heavily on high-quality data. The goal of this investigation of contract research organizations (CROs) was to assess the impact of data management activities (such as curation, quality assessment and integration) on model-informed drug development (MIDD) deliverables. A survey was sent to a diverse sample of CROs, to evaluate their baseline experience with assessing the data quality of sponsor-provided data and the time required to create analysis-ready datasets. It was distributed to 44 colleagues from 32 companies offering pharmacometrics services, including data management. The survey included 11 questions; 9 were multiple choice and 2 open-ended. Responses were gathered anonymously to ensure confidentiality and intellectual property protection and later shared with all participants. Of the 17 survey respondents, most develop data specifications and create analysis-ready datasets. The majority (65%) said the data they received from sponsors was rarely (< 10%) immediately usable due to improper formatting and quality issues like missing data and inconsistencies. Over 50% cited lack of definition/specifications as the primary reason. Assuming an average programming cost of $250/hour, cleaning client data takes CROs 3 to 24 h, costing between $750 and $6000 per dataset. Significant time is spent on rectifying poor-quality data. Automated data quality assessments can improve efficiency checks, though automation alone cannot resolve all quality issues. Better communication, collaboration, and systematic approaches to address data quality issues involving automation and AI are essential to further improve data quality.

Keywords: AI; CROs; data quality; model-informed drug development; pharmacometrics.

PubMed Disclaimer

Conflict of interest statement

Declarations. Conflict of interest: The authors have no conflicts of interest to declare related to this research. JFK, AdlP, and RLH are employees and minor stockholders of Simulations Plus, Inc. JSB is an employee of Aridhia Bioinformatics.

References

    1. Rennane S, Baker L, Mulcahy A. Estimating the cost of industry investment in drug research and development: a review of methods and results. Inquiry. 2021;58:1–11. https://doi.org/10.1177/00469580211059731 .
    1. https://www.healthcatalyst.com/learn/white-papers/extended-real-world-da...
    1. Yonchev D, Dimova D, Stumpfe D, Vogt M, Bajorath J. Redundancy in two major compound databases. Drug Discov Today. 2018;23:1183–6. https://doi.org/10.1016/j.drudis.2018.03.005 . - DOI - PubMed
    1. Institute of Medicine (US) Roundtable on Research and Development of Drugs, Biologics, and Medical Devices. In: Davis JR, Nolan VP, Woodcock J, et al., editors. Assuring data quality and validity in clinical trials for regulatory decision making: workshop report. Washington (DC): National Academies Press (US); 1999. Available from: https://www.ncbi.nlm.nih.gov/books/NBK224582/ . Accessed 19 Oct 2025.
    1. CDISC [Internet]. Austin (TX): Clinical Data Interchange Standards Consortium; 2025. Available from: https://www.cdisc.org/ . Accessed 19 Oct 2025.

LinkOut - more resources