Automated data extraction tool (DET) for external applications in radiotherapy
- PMID: 36659909
- PMCID: PMC9842687
- DOI: 10.1016/j.tipsro.2022.12.001
Automated data extraction tool (DET) for external applications in radiotherapy
Abstract
Oncological Information Systems (OIS) manage information in radiotherapy (RT) departments. Due to database structure limitations, stored information can rarely be directly used except for vendor-specific purposes. Our aim is to enable the use of such data in various external applications by creating a tool for automatic data extraction, cleaning and formatting.
Methods and materials: We used OIS data from a nine-linac RT department in Sweden (70 weeks, 2015-16). Extracted data included patients' referrals and appointments with details for RT sub-tasks. The data extraction tool to prepare the data for external use was built in C# programming language. It used excel-automation queries to remove unassigned/duplicated values, substitute missing data and perform application-specific calculations. Descriptive statistics were used to verify the output with the manually prepared dataset from the corresponding time period.
Results: From the initial raw data, 2030 (51 %)/907 (23 %) patients had known curative and palliative treatment intent for 84 different cancer diagnoses. After removal of incomplete entries, 373 (10 %) patients had unknown treatment intents which were substituted based on the known curative/palliative ratio. Automatically- and manuallyprepared datasets differed < 1 % for Mould, Treatment planning, Quality assurance and ± 5 % for Fractions and Magnetic resonance imaging with overestimations in 80/140 (57 %) entries by the tool.
Conclusion: We successfully implemented a software tool to prepare ready-to-use OIS datasets for external applications. Our evaluations showed overall results close to the manually-prepared dataset. The time taken to prepare the dataset using our automated strategy can reduce the time for manual preparation from weeks to seconds.
Keywords: Automation; Data cleaning; Data extraction; Radiotherapy.
© 2022 The Author(s).
Conflict of interest statement
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Figures




References
-
- Babashov V., Aivas I., Begen M.A., Cao J.Q., Rodrigues G., D'Souza D., et al. Reducing Patient Waiting Times for Radiation Therapy and Improving the Treatment Planning Process: a Discrete-event Simulation Model (Radiation Treatment Planning) Clin Oncol (R Coll Radiol) 2017;29(6):385–391. doi: 10.1016/j.clon.2017.01.039. - DOI - PubMed
-
- Lindberg J., Gurjar M., Holmstrom P., Hallberg S., Bjork-Eriksson T., Olsson C.E. Resource planning principles for the radiotherapy process using simulations applied to a longer vacation period use case. Tech Innov Patient Support Radiat Oncol. 2021;20:17–22. doi: 10.1016/j.tipsro.2021.10.001. - DOI - PMC - PubMed
LinkOut - more resources
Full Text Sources