Development and Evaluation of a Digital Scribe: Conversation Summarization Pipeline for Emergency Department Counseling Sessions towards Reducing Documentation Burden

doi:10.1101/2023.12.06.23299573

This is a preprint.

It has not yet been peer reviewed by a journal.

The National Library of Medicine is running a pilot to include preprints that result from research funded by NIH in PMC and PubMed.

[Preprint]. 2023 Dec 7:2023.12.06.23299573.

doi: 10.1101/2023.12.06.23299573.

Development and Evaluation of a Digital Scribe: Conversation Summarization Pipeline for Emergency Department Counseling Sessions towards Reducing Documentation Burden

Emre Sezgin^{1

2}, Joseph Sirrianni¹, Kelly Kranz¹

Affiliations

PMID: 38106162
PMCID: PMC10723557
DOI: 10.1101/2023.12.06.23299573

Development and Evaluation of a Digital Scribe: Conversation Summarization Pipeline for Emergency Department Counseling Sessions towards Reducing Documentation Burden

Emre Sezgin et al. medRxiv. 2023.

[Preprint]. 2023 Dec 7:2023.12.06.23299573.

doi: 10.1101/2023.12.06.23299573.

Authors

Emre Sezgin^{1

2}, Joseph Sirrianni¹, Kelly Kranz¹

Affiliations

¹ Nationwide Children's Hospital, Columbus OH.
² Ohio State University College of Medicine, Columbus OH.

PMID: 38106162
PMCID: PMC10723557
DOI: 10.1101/2023.12.06.23299573

Abstract

Objective: We present a proof-of-concept digital scribe system as an ED clinical conversation summarization pipeline and report its performance.

Materials and methods: We use four pre-trained large language models to establish the digital scribe system: T5-small, T5-base, PEGASUS-PubMed, and BART-Large-CNN via zero-shot and fine-tuning approaches. Our dataset includes 100 referral conversations among ED clinicians and medical records. We report the ROUGE-1, ROUGE-2, and ROUGE-L to compare model performance. In addition, we annotated transcriptions to assess the quality of generated summaries.

Results: The fine-tuned BART-Large-CNN model demonstrates greater performance in summarization tasks with the highest ROUGE scores (F1_ROUGE-1=0.49, F1_ROUGE-2=0.23, F1_ROUGE-L=0.35) scores. In contrast, PEGASUS-PubMed lags notably (F1_ROUGE-1=0.28, F1_ROUGE-2=0.11, F1_ROUGE-L=0.22). BART-Large-CNN's performance decreases by more than 50% with the zero-shot approach. Annotations show that BART-Large-CNN performs 71.4% recall in identifying key information and a 67.7% accuracy rate.

Discussion: The BART-Large-CNN model demonstrates a high level of understanding of clinical dialogue structure, indicated by its performance with and without fine-tuning. Despite some instances of high recall, there is variability in the model's performance, particularly in achieving consistent correctness, suggesting room for refinement. The model's recall ability varies across different information categories.

Conclusion: The study provides evidence towards the potential of AI-assisted tools in reducing clinical documentation burden. Future work is suggested on expanding the research scope with larger language models, and comparative analysis to measure documentation efforts and time.

Keywords: Emergency department; Natural Language Processing; Text Summarization; clinical conversation; documentation burden; pre-trained language model.

PubMed Disclaimer

Conflict of interest statement

Conflict of interest None declared.

Figures

**Figure 2.**
A histogram of the number of tokens per transcript. The tokens were generated for this graph using the BART tokenizer.[40] The vertical line represents the maximum input length of the models, 1024 tokens, and 82% of transcripts clusters to the left of this line.

**Figure 3.**
Histograms showing information recalled (without consideration of correctness) [right] and correctly recalled information [left] by a generated summary that appeared in the ground truth summary.

**Figure 4.**
Example generated and nurse note samples with LINK and CORRECT annotations

See this image and copyright information in PMC

References

1. Quiroz JC, Laranjo L, Kocaballi AB, et al. Challenges of developing a digital scribe to reduce clinical documentation burden. NPJ Digit Med 2019;2:114. doi:10.1038/s41746-019-0190-1 - DOI - PMC - PubMed
1. Chandawarkar A, Chaparro JD. Burnout in clinicians. Curr Probl Pediatr Adolesc Health Care 2021;51:101104. doi:10.1016/j.cppeds.2021.101104 - DOI - PMC - PubMed
1. Joukes E, Abu-Hanna A, Cornet R, et al. Time Spent on Dedicated Patient Care and Documentation Tasks Before and After the Introduction of a Structured and Standardized Electronic Health Record. Appl Clin Inform 2018;9:46–53. doi:10.1055/s-0037-1615747 - DOI - PMC - PubMed
1. Moukarzel A, Michelet P, Durand A-C, et al. Burnout Syndrome among Emergency Department Staff: Prevalence and Associated Factors. Biomed Res Int 2019;2019:6462472. doi:10.1155/2019/6462472 - DOI - PMC - PubMed
1. Moy AJ, Hobensack M, Marshall K, et al. Understanding the perceived role of electronic health records and workflow fragmentation on clinician documentation burden in emergency departments. J Am Med Inform Assoc 2023;30:797–808. doi:10.1093/jamia/ocad038 - DOI - PMC - PubMed

Publication types

Actions

Grants and funding

UM1 TR004548/TR/NCATS NIH HHS/United States

LinkOut - more resources

Full Text Sources

[1] Quiroz JC, Laranjo L, Kocaballi AB, et al. Challenges of developing a digital scribe to reduce clinical documentation burden. NPJ Digit Med 2019;2:114. doi:10.1038/s41746-019-0190-1 - DOI - PMC - PubMed

[2] Quiroz JC, Laranjo L, Kocaballi AB, et al. Challenges of developing a digital scribe to reduce clinical documentation burden. NPJ Digit Med 2019;2:114. doi:10.1038/s41746-019-0190-1 - DOI - PMC - PubMed

[3] Chandawarkar A, Chaparro JD. Burnout in clinicians. Curr Probl Pediatr Adolesc Health Care 2021;51:101104. doi:10.1016/j.cppeds.2021.101104 - DOI - PMC - PubMed

[4] Chandawarkar A, Chaparro JD. Burnout in clinicians. Curr Probl Pediatr Adolesc Health Care 2021;51:101104. doi:10.1016/j.cppeds.2021.101104 - DOI - PMC - PubMed

[5] Joukes E, Abu-Hanna A, Cornet R, et al. Time Spent on Dedicated Patient Care and Documentation Tasks Before and After the Introduction of a Structured and Standardized Electronic Health Record. Appl Clin Inform 2018;9:46–53. doi:10.1055/s-0037-1615747 - DOI - PMC - PubMed

[6] Joukes E, Abu-Hanna A, Cornet R, et al. Time Spent on Dedicated Patient Care and Documentation Tasks Before and After the Introduction of a Structured and Standardized Electronic Health Record. Appl Clin Inform 2018;9:46–53. doi:10.1055/s-0037-1615747 - DOI - PMC - PubMed

[7] Moukarzel A, Michelet P, Durand A-C, et al. Burnout Syndrome among Emergency Department Staff: Prevalence and Associated Factors. Biomed Res Int 2019;2019:6462472. doi:10.1155/2019/6462472 - DOI - PMC - PubMed

[8] Moukarzel A, Michelet P, Durand A-C, et al. Burnout Syndrome among Emergency Department Staff: Prevalence and Associated Factors. Biomed Res Int 2019;2019:6462472. doi:10.1155/2019/6462472 - DOI - PMC - PubMed

[9] Moy AJ, Hobensack M, Marshall K, et al. Understanding the perceived role of electronic health records and workflow fragmentation on clinician documentation burden in emergency departments. J Am Med Inform Assoc 2023;30:797–808. doi:10.1093/jamia/ocad038 - DOI - PMC - PubMed

[10] Moy AJ, Hobensack M, Marshall K, et al. Understanding the perceived role of electronic health records and workflow fragmentation on clinician documentation burden in emergency departments. J Am Med Inform Assoc 2023;30:797–808. doi:10.1093/jamia/ocad038 - DOI - PMC - PubMed

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

This is a preprint.

Development and Evaluation of a Digital Scribe: Conversation Summarization Pipeline for Emergency Department Counseling Sessions towards Reducing Documentation Burden

Affiliations

Development and Evaluation of a Digital Scribe: Conversation Summarization Pipeline for Emergency Department Counseling Sessions towards Reducing Documentation Burden

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

Publication types

Grants and funding

LinkOut - more resources

Full Text Sources

This is a preprint.

Abstract

Conflict of interest statement

Figures

Similar articles

References

Publication types

Related information

Grants and funding

LinkOut - more resources

Full Text Sources