Use of automatic SQL generation interface to enhance transparency and validity of health-data analysis
- PMID: 35874460
- PMCID: PMC9306316
- DOI: 10.1016/j.imu.2022.100996
Use of automatic SQL generation interface to enhance transparency and validity of health-data analysis
Abstract
Analysis of health data typically requires development of queries using structured query language (SQL) by a data-analyst. As the SQL queries are manually created, they are prone to errors. In addition, accurate implementation of the queries depends on effective communication with clinical experts, that further makes the analysis error prone. As a potential resolution, we explore an alternative approach wherein a graphical interface that automatically generates the SQL queries is used to perform the analysis. The latter allows clinical experts to directly perform complex queries on the data, despite their unfamiliarity with SQL syntax. The interface provides an intuitive understanding of the query logic which makes the analysis transparent and comprehensible to the clinical study-staff, thereby enhancing the transparency and validity of the analysis. This study demonstrates the feasibility of using a user-friendly interface that automatically generate SQL for analysis of health data. It outlines challenges that will be useful for designing user-friendly tools to improve transparency and reproducibility of data analysis.
Keywords: Databases; Graphical user-interface; Reproducibility of analysis; Structured query language; Validity of analysis.
Conflict of interest statement
Declaration of competing interest The authors declare that they have no conflict of interest.
Figures
References
-
- Benson MD, McPartlin M, Matta L, et al. A remote lipid management program improves appropriate statin use and cholesterol levels across a wide population of high cardiovascular risk patients. J Am Coll Cardiol 2018;71:A1762.
Grants and funding
LinkOut - more resources
Full Text Sources
Research Materials