Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;7(3):e33677.
doi: 10.1371/journal.pone.0033677. Epub 2012 Mar 16.

The database for aggregate analysis of ClinicalTrials.gov (AACT) and subsequent regrouping by clinical specialty

Affiliations

The database for aggregate analysis of ClinicalTrials.gov (AACT) and subsequent regrouping by clinical specialty

Asba Tasneem et al. PLoS One. 2012.

Abstract

Background: The ClinicalTrials.gov registry provides information regarding characteristics of past, current, and planned clinical studies to patients, clinicians, and researchers; in addition, registry data are available for bulk download. However, issues related to data structure, nomenclature, and changes in data collection over time present challenges to the aggregate analysis and interpretation of these data in general and to the analysis of trials according to clinical specialty in particular. Improving usability of these data could enhance the utility of ClinicalTrials.gov as a research resource.

Methods/principal results: The purpose of our project was twofold. First, we sought to extend the usability of ClinicalTrials.gov for research purposes by developing a database for aggregate analysis of ClinicalTrials.gov (AACT) that contains data from the 96,346 clinical trials registered as of September 27, 2010. Second, we developed and validated a methodology for annotating studies by clinical specialty, using a custom taxonomy employing Medical Subject Heading (MeSH) terms applied by an NLM algorithm, as well as MeSH terms and other disease condition terms provided by study sponsors. Clinical specialists reviewed and annotated MeSH and non-MeSH disease condition terms, and an algorithm was created to classify studies into clinical specialties based on both MeSH and non-MeSH annotations. False positives and false negatives were evaluated by comparing algorithmic classification with manual classification for three specialties.

Conclusions/significance: The resulting AACT database features study design attributes parsed into discrete fields, integrated metadata, and an integrated MeSH thesaurus, and is available for download as Oracle extracts (.dmp file and text format). This publicly-accessible dataset will facilitate analysis of studies and permit detailed characterization and analysis of the U.S. clinical trials enterprise as a whole. In addition, the methodology we present for creating specialty datasets may facilitate other efforts to analyze studies by specialty groups.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1. A schematic representation of the database for Aggregate Analysis of ClinicalTrials.Gov (AACT) with its key enhancements.
Figure 2
Figure 2. High-level Entity-Relationship Diagram (ERD) for AACT.
Figure 3
Figure 3. Percentage of interventional studies with complete data by registration year for selected data elements.
Figure 4
Figure 4. An overview of methodology and process of developing clinical specialty datasets.
The INTERVENTIONS, CONDITIONS, and KEYWORDS tables consist of disease condition terms provided by data submitters that include both MeSH and non-MeSH terms. The INTERVENTION_BROWSE and CONDITION_BROWSE tables are populated by MeSH terms generated by NLM algorithm (a) Process illustrating how MeSH terms are created in ClinicalTrials.gov. Tables and data shown here does not represent entire ClinicalTrials.gov database (b) Process illustrating the annotation and validation of disease conditions (c) Process illustrating the creation of specialty datasets.
Figure 5
Figure 5. MeSH trees for acromegaly.
Source: 2010 online MeSH thesaurus (available: http://www.nlm.nih.gov/cgi/mesh/2010/MB_cgi).
Figure 6
Figure 6. Rules for deciding whether a given study belongs to a given specialty.

References

    1. Public Law 105–115. The Food and Drug Administration Modernization Act. Available: http://www.fda.gov/RegulatoryInformation/Legislation/FederalFoodDrugandC.... Accessed 2012 Jan 9.
    1. ClinicalTrials.gov Web site. About ClinicalTrials.gov. Available: http://www.clinicaltrials.gov/ct2/info/about. Accessed 2011 May 6.
    1. DeAngelis CD, Drazen JM, Frizelle FA, Haug C, Hoey J, et al. Clinical trial registration: a statement from the International Committee of Medical Journal Editors. JAMA. 2004;292:1363–1364. - PubMed
    1. Zarin DA, Tse T, Williams RJ, Califf RM, Ide NC. The ClinicalTrials.gov results database–update and key issues. N Engl J Med. 2011;364:852–60. - PMC - PubMed
    1. Clinical Trials Transformation Initiative Web site. Welcome to CTTI. Available: https://www.trialstransformation.org. Accessed 2012 Jan 9.

Publication types

MeSH terms