Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 Mar 4:14:34.
doi: 10.1186/1471-2288-14-34.

The thresholds for statistical and clinical significance - a five-step procedure for evaluation of intervention effects in randomised clinical trials

Affiliations

The thresholds for statistical and clinical significance - a five-step procedure for evaluation of intervention effects in randomised clinical trials

Janus Christian Jakobsen et al. BMC Med Res Methodol. .

Abstract

Background: Thresholds for statistical significance are insufficiently demonstrated by 95% confidence intervals or P-values when assessing results from randomised clinical trials. First, a P-value only shows the probability of getting a result assuming that the null hypothesis is true and does not reflect the probability of getting a result assuming an alternative hypothesis to the null hypothesis is true. Second, a confidence interval or a P-value showing significance may be caused by multiplicity. Third, statistical significance does not necessarily result in clinical significance. Therefore, assessment of intervention effects in randomised clinical trials deserves more rigour in order to become more valid.

Methods: Several methodologies for assessing the statistical and clinical significance of intervention effects in randomised clinical trials were considered. Balancing simplicity and comprehensiveness, a simple five-step procedure was developed.

Results: For a more valid assessment of results from a randomised clinical trial we propose the following five-steps: (1) report the confidence intervals and the exact P-values; (2) report Bayes factor for the primary outcome, being the ratio of the probability that a given trial result is compatible with a 'null' effect (corresponding to the P-value) divided by the probability that the trial result is compatible with the intervention effect hypothesised in the sample size calculation; (3) adjust the confidence intervals and the statistical significance threshold if the trial is stopped early or if interim analyses have been conducted; (4) adjust the confidence intervals and the P-values for multiplicity due to number of outcome comparisons; and (5) assess clinical significance of the trial results.

Conclusions: If the proposed five-step procedure is followed, this may increase the validity of assessments of intervention effects in randomised clinical trials.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A figure showing how Bayes factor will change according to different observed effects. The red left vertical line represents the null hypothesis (an effect of null), the right green vertical line represents an alternative hypothesis to the null hypothesis with an effect of 1.0. The black curve shows that Bayes factor will be 1.0 when the observed effect size if exactly half of the effect size of the alternative hypothesis; and the curve shows that Bayes factor will decease with increasing observed effect sizes.

References

    1. Jakobsen JC, Gluud C. The necessity of randomized clinical trials. Br J Med Res. 2013;3(4):1453–1468.
    1. Johnson VE. Revised standards for statistical evidence. Proc Natl Acad Sci USA. 2013;110(48):19313–19317. doi: 10.1073/pnas.1313476110. - DOI - PMC - PubMed
    1. Fisher R. Statistical methods and scientific induction. J R Stat Soc Ser B. 1955;17(1):69–78.
    1. Gigerenzer G. Mindless statistics. J Socio Econ. 2004;33(5):587–606. doi: 10.1016/j.socec.2004.09.033. - DOI
    1. Hald A. A history of mathematical statistics from 1750 to 1930. New York: John Wiley & Sons; 1998.

MeSH terms