Causal analyses of existing databases: no power calculations required
- PMID: 34461211
- PMCID: PMC8882204
- DOI: 10.1016/j.jclinepi.2021.08.028
Causal analyses of existing databases: no power calculations required
Abstract
Observational databases are often used to study causal questions. Before being granted access to data or funding, researchers may need to prove that "the statistical power of their analysis will be high." Analyses expected to have low power, and hence result in imprecise estimates, will not be approved. This restrictive attitude towards observational analyses is misguided. A key misunderstanding is the belief that the goal of a causal analysis is to "detect" an effect. Causal effects are not binary signals that are either detected or undetected; causal effects are numerical quantities that need to be estimated. Because the goal is to quantify the effect as unbiasedly and precisely as possible, the solution to observational analyses with imprecise effect estimates is not avoiding observational analyses with imprecise estimates, but rather encouraging the conduct of many observational analyses. It is preferable to have multiple studies with imprecise estimates than having no study at all. After several studies become available, we will meta-analyze them and provide a more precise pooled effect estimate. Therefore, the justification to withhold an observational analysis of preexisting data cannot be that our estimates will be imprecise. Ethical arguments for power calculations before conducting a randomized trial which place individuals at risk are not transferable to observational analyses of existing databases. If a causal question is important, analyze your data, publish your estimates, encourage others to do the same, and then meta-analyze. The alternative is an unanswered question.
Keywords: Causal analysis; Causal inference; Meta-analysis; Observational analysis; Observational studies; Sample size; Statistical power; Statistical significance.
Copyright © 2021 Elsevier Inc. All rights reserved.
Conflict of interest statement
The author declares that there is no conflict of interest.
Comment in
-
Sample size considerations are needed for the causal analyses of existing databases.J Clin Epidemiol. 2022 Jan;141:212. doi: 10.1016/j.jclinepi.2021.09.024. Epub 2021 Sep 20. J Clin Epidemiol. 2022. PMID: 34551318 No abstract available.
-
Causal analyses of existing databases: the importance of understanding what can be achieved with your data before analysis (commentary on Hernán).J Clin Epidemiol. 2022 Feb;142:261-263. doi: 10.1016/j.jclinepi.2021.09.026. Epub 2021 Sep 22. J Clin Epidemiol. 2022. PMID: 34560253 No abstract available.
-
European non-commercial sponsors showed substantial variation in results reporting to the EU trial registry.J Clin Epidemiol. 2022 Feb;142:161-170. doi: 10.1016/j.jclinepi.2021.11.005. Epub 2021 Nov 10. J Clin Epidemiol. 2022. PMID: 34767965
-
A few things to consider when deciding whether or not to conduct underpowered research.J Clin Epidemiol. 2022 Apr;144:194-197. doi: 10.1016/j.jclinepi.2021.11.038. Epub 2021 Dec 4. J Clin Epidemiol. 2022. PMID: 34875377 No abstract available.
-
Causal analysis of existing databases: no power calculations required. Responses to Campbell, Morris and Mansournia, et al.J Clin Epidemiol. 2022 Apr;144:193. doi: 10.1016/j.jclinepi.2021.11.039. Epub 2021 Dec 4. J Clin Epidemiol. 2022. PMID: 34875379 Free PMC article. No abstract available.
References
-
- Jick H, Walker AM, Watkins RN, D’Ewart DC, Hunter JR, Danford A, Madsen S, Dinan BJ, Rothman KJ. Oral contraceptives and breast cancer. American Journal of Epidemiology 1980; 112(5):577– 585. - PubMed
-
-
Greenland S. Invited Commentary: The Need for Cognitive Science in Methodology. American Journal of Epidemiology 2017; 186(6); 639–645
This paper discusses how null-hypothesis significance testing combines three cognitive problems (dichotomania, nullism, and statistical reification) to produce highly distorted interpretation and reporting of study results.
-
-
-
Hernán MA. Does water kill? A call for less casual causal inferences. Annals of Epidemiology 2016; 26: 674–680.
a.This paper emphasizes that the goal of data analysis for causal inference is to quantify causal effects as validly and precisely as possible, rather than to identify whether something is “a cause.”
-
-
-
Lash TL. The Harm Done to Reproducibility by the Culture of Null Hypothesis Significance Testing. American Journal of Epidemiology 2017; 186(6):627–635.
This paper describes how the reported reproducibility crisis in science is partly due to the culture of null hypothesis significance testing that dominates statistical analysis and inference.
-
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
