Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Apr 21;8(4):201925.
doi: 10.1098/rsos.201925.

The multiplicity of analysis strategies jeopardizes replicability: lessons learned across disciplines

Affiliations

The multiplicity of analysis strategies jeopardizes replicability: lessons learned across disciplines

Sabine Hoffmann et al. R Soc Open Sci. .

Abstract

For a given research question, there are usually a large variety of possible analysis strategies acceptable according to the scientific standards of the field, and there are concerns that this multiplicity of analysis strategies plays an important role in the non-replicability of research findings. Here, we define a general framework on common sources of uncertainty arising in computational analyses that lead to this multiplicity, and apply this framework within an overview of approaches proposed across disciplines to address the issue. Armed with this framework, and a set of recommendations derived therefrom, researchers will be able to recognize strategies applicable to their field and use them to generate findings more likely to be replicated in future studies, ultimately improving the credibility of the scientific process.

Keywords: interdisciplinary perspective; metaresearch; open science; replicability crisis; uncertainty.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
The multiplicity of analysis strategies arising from data preprocessing, model and method choices to obtain an estimate of the parameter of interest θ and values of the outcome variable Y for two research questions in epidemiology and hydroclimatology, respectively.
Figure 2.
Figure 2.
Sources of uncertainty in explanatory, mechanistic predictive and agnostic predictive modelling. Data preprocessing, parameter, model and method uncertainty are epistemic sources of uncertainty arising from a lack of knowledge in the specification of the analysis strategy. Measurement and sampling uncertainty are random sources of uncertainty that lead to variability in the results when the same analysis strategy is applied on different datasets. The model structure describes the association between the p input variables X1,X2,,Xp and the outcome of interest Y. θ is a parameter and e represents a probabilistic error term.
Figure 3.
Figure 3.
The impact of random sources of uncertainty and of the multiplicity of possible analysis strategies on the replicability of research findings. The result of interest is the parameter θ in explanatory modelling, the outcome Y in mechanistic predictive modelling and the predictive performance in agnostic predictive modelling. The yellow colour represents the results of the chosen analysis strategy—a strategy selected because it presents the most ‘favourable’ results. It is clear that the traditional confidence interval (given by the bars around the estimate ‘x’), which only takes into account sampling uncertainty, is inadequate in capturing the true uncertainty in the estimate.
Figure 4.
Figure 4.
Overview of solutions to the replication crisis which address the multiplicity of analysis strategies by reducing, reporting, integrating or accepting uncertainty. For an interactive version of this graphic with assorted references see https://shiny.psy.lmu.de/multiplicity/index.html.

References

    1. Loken E, Gelman A. 2017. Measurement error and the replication crisis. Science 355, 584-585. (10.1126/science.aal3618) - DOI - PubMed
    1. Baker M. 2016. 1,500 scientists lift the lid on reproducibility: survey sheds light on the `crisis' rocking research. Nature 533, 452-455. (10.1038/533452a) - DOI - PubMed
    1. Gelman A, Loken E. 2014. The statistical crisis in science. Am. Sci. 102, 460-465. (10.1511/2014.111.460) - DOI
    1. Begley CG, Ellis LM. 2012. Drug development: raise standards for preclinical cancer research. Nature 483, 531-533. (10.1038/483531a) - DOI - PubMed
    1. Ioannidis JPA et al. 2009. Repeatability of published microarray gene expression analyses. Nat. Genet. 41, 149-155. (10.1038/ng.295) - DOI - PubMed

LinkOut - more resources