Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2023 Sep 16;38(10):1035-1042.
doi: 10.1007/s10654-023-01047-8. Online ahead of print.

Statistical inference and effect measures in abstracts of randomized controlled trials, 1975-2021. A systematic review

Affiliations
Review

Statistical inference and effect measures in abstracts of randomized controlled trials, 1975-2021. A systematic review

Andreas Stang et al. Eur J Epidemiol. .

Abstract

Objective: To examine the time trend of statistical inference, statistical reporting style of results, and effect measures from the abstracts of randomized controlled trials (RCTs).

Study desgin and settings: We downloaded 385,867 PubMed abstracts of RCTs from 1975 to 2021. We used text-mining to detect reporting of statistical inference (p-values, confidence intervals, significance terminology), statistical reporting style of results, and effect measures for binary outcomes, including time-to-event measures. We validated the text mining algorithms by random samples of abstracts.

Results: A total of 320 676 abstracts contained statistical inference. The percentage of abstracts including statistical inference increased from 65% (1975) to 87% (2006) and then decreased slightly. From 1975 to 1990, the sole reporting of language regarding statistical significance was predominant. Since 1990, reporting of p-values without confidence intervals has been the most common reporting style. Reporting of confidence intervals increased from 0.5% (1975) to 29% (2021). The two most common effect measures for binary outcomes were hazard ratios and odds ratios. Number needed to treat and number needed to harm are reported in less than 5% of abstracts with binary endpoints.

Conclusions: Reporting of statistical inference in abstracts of RCTs has increased over time. Increasingly, p-values and confidence intervals are reported rather than just mentioning the presence of "statistical significance". The reporting of odds ratios comes with the liability that the untrained reader will interpret them as risk ratios, which is often not justified, especially in RCTs.

Keywords: Confidence intervals; Randomized controlled trials; Statistics; Statistics and numerical data.

PubMed Disclaimer

Conflict of interest statement

The authors have no relevant financial or non-financial interests to disclose.

Figures

Fig. 1
Fig. 1
Annual number of abstracts of randomized controlled trials and percentage of abstracts containing statistical inference, 1975–2021 red graph: estimated proportion of abstracts on RCTs that include statistical inference, by year (LOESS with smooth = 0.2) with 95% confidence interval bands; black dots: observed proportion, by year; dashed line: total number of abstracts of RCTs per year
Fig. 2
Fig. 2
Estimated time trends, 1975–2021, of the statistical reporting style in abstracts on randomized controlled trials (non-hierarchical) Blue indicates reporting of confidence intervals; green indicates reporting of numerical p-values or p thresholds, (e.g. p < 0.01); red indicates reporting of significance language. All trend lines are LOESS smoothed with inverse-variance weighting
Fig. 3
Fig. 3
Flexibly estimated time trends 1975–2021 of the statistical reporting style in abstracts on randomized controlled trials (hierarchical) blue graphs indicate reporting of confidence intervals; green graphs indicate reporting of numerical p-values or p thresholds (e.g. p < 0.01) without confidence intervals; red graphs indicate reporting of significance without confidence intervals and p-values; all trend lines are LOESS smoothed with inverse-variance weighting

References

    1. Rothman KJ. A show of confidence. N Engl J Med. 1978;299:1362–3. doi: 10.1056/NEJM197812142992410. - DOI - PubMed
    1. Stang A, Poole C, Bender R. Common problems related to the use of number needed to treat. J ClinEpidemiol. 2010;63:820–5. - PubMed
    1. Langman MJ. Towards estimation and confidence intervals. Br Med J (Clin Res Ed) 1986;292:716. doi: 10.1136/bmj.292.6522.716. - DOI - PMC - PubMed
    1. Begg C, Cho M, Eastwood S, Horton R, Moher D, Olkin I, et al. Improving the quality of reporting of randomized controlled trials. The CONSORT statement. JAMA. 1996;276:637–9. doi: 10.1001/jama.1996.03540080059030. - DOI - PubMed
    1. Altman DG. Better reporting of randomised controlled trials: the CONSORT statement. BMJ. 1996;313:570–1. doi: 10.1136/bmj.313.7057.570. - DOI - PMC - PubMed

LinkOut - more resources