Data-Driven Hypothesis Generation in Clinical Research: What We Learned from a Human Subject Study?
- PMID: 39211055
- PMCID: PMC11361316
- DOI: 10.18103/mra.v12i2.5132
Data-Driven Hypothesis Generation in Clinical Research: What We Learned from a Human Subject Study?
Abstract
Hypothesis generation is an early and critical step in any hypothesis-driven clinical research project. Because it is not yet a well-understood cognitive process, the need to improve the process goes unrecognized. Without an impactful hypothesis, the significance of any research project can be questionable, regardless of the rigor or diligence applied in other steps of the study, e.g., study design, data collection, and result analysis. In this perspective article, the authors provide a literature review on the following topics first: scientific thinking, reasoning, medical reasoning, literature-based discovery, and a field study to explore scientific thinking and discovery. Over the years, scientific thinking has shown excellent progress in cognitive science and its applied areas: education, medicine, and biomedical research. However, a review of the literature reveals the lack of original studies on hypothesis generation in clinical research. The authors then summarize their first human participant study exploring data-driven hypothesis generation by clinical researchers in a simulated setting. The results indicate that a secondary data analytical tool, VIADS-a visual interactive analytic tool for filtering, summarizing, and visualizing large health data sets coded with hierarchical terminologies, can shorten the time participants need, on average, to generate a hypothesis and also requires fewer cognitive events to generate each hypothesis. As a counterpoint, this exploration also indicates that the quality ratings of the hypotheses thus generated carry significantly lower ratings for feasibility when applying VIADS. Despite its small scale, the study confirmed the feasibility of conducting a human participant study directly to explore the hypothesis generation process in clinical research. This study provides supporting evidence to conduct a larger-scale study with a specifically designed tool to facilitate the hypothesis-generation process among inexperienced clinical researchers. A larger study could provide generalizable evidence, which in turn can potentially improve clinical research productivity and overall clinical research enterprise.
Keywords: Clinical research; data-driven hypothesis generation; medical informatics; scientific hypothesis generation; translational research; visualization.
Figures



Similar articles
-
The Roles of a Secondary Data Analytics Tool and Experience in Scientific Hypothesis Generation in Clinical Research: Protocol for a Mixed Methods Study.JMIR Res Protoc. 2022 Jul 18;11(7):e39414. doi: 10.2196/39414. JMIR Res Protoc. 2022. PMID: 35736798 Free PMC article.
-
Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools.medRxiv [Preprint]. 2023 Oct 31:2023.05.30.23290719. doi: 10.1101/2023.05.30.23290719. medRxiv. 2023. Update in: J Clin Transl Sci. 2024 Jan 04;8(1):e13. doi: 10.1017/cts.2023.708. PMID: 37333271 Free PMC article. Updated. Preprint.
-
Data-driven hypothesis generation among inexperienced clinical researchers: A comparison of secondary data analyses with visualization (VIADS) and other tools.J Clin Transl Sci. 2024 Jan 4;8(1):e13. doi: 10.1017/cts.2023.708. eCollection 2024. J Clin Transl Sci. 2024. PMID: 38384898 Free PMC article.
-
Dietary glycation compounds - implications for human health.Crit Rev Toxicol. 2024 Sep;54(8):485-617. doi: 10.1080/10408444.2024.2362985. Epub 2024 Aug 16. Crit Rev Toxicol. 2024. PMID: 39150724
-
[Aiming for zero blindness].Nippon Ganka Gakkai Zasshi. 2015 Mar;119(3):168-93; discussion 194. Nippon Ganka Gakkai Zasshi. 2015. PMID: 25854109 Review. Japanese.
Cited by
-
The quality of data-driven hypotheses generated by inexperienced clinical researchers: A case study.medRxiv [Preprint]. 2024 Aug 13:2024.08.12.24311877. doi: 10.1101/2024.08.12.24311877. medRxiv. 2024. Update in: Health Informatics J. 2025 Jul-Sep;31(3):14604582251353587. doi: 10.1177/14604582251353587. PMID: 39185523 Free PMC article. Updated. Preprint.
-
Using think-aloud protocol to identify cognitive events while generating data-driven scientific hypotheses by inexperienced clinical researchers.AMIA Annu Symp Proc. 2025 May 22;2024:561-570. eCollection 2024. AMIA Annu Symp Proc. 2025. PMID: 40417518 Free PMC article.
References
-
- Pruzan P. Research Methodology: The Aims, Practices and Ethics of Science. Springer International Publishing Switzerland; 2016.
-
- Hicks CM. Research methods for clinical therapists: Applied project design and analysis. 1999; - PubMed
-
- Supino P, Borer J. Principles of research methodology: A guide for clinical investigators. 2012;
-
- Browner W, Newman T, Cummings S, et al. Designing Clinical Research. 5th ed. Wolters Kluwer; 2023.
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials