Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022:31:100996.
doi: 10.1016/j.imu.2022.100996. Epub 2022 Jun 25.

Use of automatic SQL generation interface to enhance transparency and validity of health-data analysis

Affiliations

Use of automatic SQL generation interface to enhance transparency and validity of health-data analysis

Kavishwar B Wagholikar et al. Inform Med Unlocked. 2022.

Abstract

Analysis of health data typically requires development of queries using structured query language (SQL) by a data-analyst. As the SQL queries are manually created, they are prone to errors. In addition, accurate implementation of the queries depends on effective communication with clinical experts, that further makes the analysis error prone. As a potential resolution, we explore an alternative approach wherein a graphical interface that automatically generates the SQL queries is used to perform the analysis. The latter allows clinical experts to directly perform complex queries on the data, despite their unfamiliarity with SQL syntax. The interface provides an intuitive understanding of the query logic which makes the analysis transparent and comprehensible to the clinical study-staff, thereby enhancing the transparency and validity of the analysis. This study demonstrates the feasibility of using a user-friendly interface that automatically generate SQL for analysis of health data. It outlines challenges that will be useful for designing user-friendly tools to improve transparency and reproducibility of data analysis.

Keywords: Databases; Graphical user-interface; Reproducibility of analysis; Structured query language; Validity of analysis.

PubMed Disclaimer

Conflict of interest statement

Declaration of competing interest The authors declare that they have no conflict of interest.

Figures

Fig. 1.
Fig. 1.
In the conventional approach the data-analyst developed a SQL query to generate the report, while in the proposed auto-SQL approach the data-analyst first denormalized the study database and then the domain expert used the i2b2-webclient graphical-user-interface that automatically generated the SQL for performing the analysis.
Fig. 2.
Fig. 2.
Graphical query interface from the i2b2 platform. The criteria for querying can be easily constructed by dragging terms from the hierarchical tree structure on the left to the widgets on the right. The SQL query is automatically generated in the back-end, which enables clinical staff that are not familiar with SQL to perform complex queries on the data.

References

    1. Benson MD, McPartlin M, Matta L, et al. A remote lipid management program improves appropriate statin use and cholesterol levels across a wide population of high cardiovascular risk patients. J Am Coll Cardiol 2018;71:A1762.
    1. Blood AJ, Fischer CM, Fera LE, et al. Rationale and design of a navigator-driven remote optimization of guideline-directed medical therapy in patients with heart failure with reduced ejection fraction. Clin Cardiol 2020;43:4–13. - PMC - PubMed
    1. Wagholikar KB, Fischer CM, Goodson AP, et al. Phenotyping to facilitate accrual for a cardiovascular intervention. J Clin Med Res 2019;11:458–63. - PMC - PubMed
    1. Gordon WJ, Blood AJ, Chaney K, et al. Workflow automation for a virtual hypertension management program. Appl Clin Inf 2021;12:1041–8. - PMC - PubMed
    1. Murphy SN, Weber G, Mendis M, et al. Serving the enterprise and beyond with informatics for integrating biology and the bedside (i2b2). J Am Med Inf Assoc 2010;17:124–30. - PMC - PubMed