Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations
- PMID: 37128439
- PMCID: PMC10148344
Automatic speech recognition performance for digital scribes: a performance comparison between general-purpose and specialized models tuned for patient-clinician conversations
Abstract
One promising solution to address physician data entry needs is through the development of so-called "digital scribes," or tools which aim to automate clinical documentation via automatic speech recognition (ASR) of patient-clinician conversations. Evaluation of specialized ASR models in this domain, useful for understanding feasibility and development opportunities, has been difficult because most models have been under development. Following the commercial release of such models, we report an independent evaluation of four models, two general-purpose, and two for medical conversation with a corpus of 36 primary care conversations. We identify word error rates (WER) of 8.8%-10.5% and word-level diarization error rates (WDER) ranging from 1.8%-13.9%, which are generally lower than previous reports. The findings indicate that, while there is room for improvement, the performance of these specialized models, at least under ideal recording conditions, may be amenable to the development of downstream applications which rely on ASR of patient-clinician conversations.
©2022 AMIA - All rights reserved.
Similar articles
-
"Mm-hm," "Uh-uh": are non-lexical conversational sounds deal breakers for the ambient clinical documentation technology?J Am Med Inform Assoc. 2023 Mar 16;30(4):703-711. doi: 10.1093/jamia/ocad001. J Am Med Inform Assoc. 2023. PMID: 36688526 Free PMC article.
-
Complete and Resilient Documentation for Operational Medical Environments Leveraging Mobile Hands-free Technology in a Systems Approach: Experimental Study.JMIR Mhealth Uhealth. 2021 Oct 12;9(10):e32301. doi: 10.2196/32301. JMIR Mhealth Uhealth. 2021. PMID: 34636729 Free PMC article.
-
A systematic comparison of contemporary automatic speech recognition engines for conversational clinical speech.AMIA Annu Symp Proc. 2018 Dec 5;2018:683-689. eCollection 2018. AMIA Annu Symp Proc. 2018. PMID: 30815110 Free PMC article.
-
How does medical scribes' work inform development of speech-based clinical documentation technologies? A systematic review.J Am Med Inform Assoc. 2020 May 1;27(5):808-817. doi: 10.1093/jamia/ocaa020. J Am Med Inform Assoc. 2020. PMID: 32181812 Free PMC article.
-
The digital scribe in clinical practice: a scoping review and research agenda.NPJ Digit Med. 2021 Mar 26;4(1):57. doi: 10.1038/s41746-021-00432-5. NPJ Digit Med. 2021. PMID: 33772070 Free PMC article.
Cited by
-
The Utility and Implications of Ambient Scribes in Primary Care.JMIR AI. 2024 Oct 4;3:e57673. doi: 10.2196/57673. JMIR AI. 2024. PMID: 39365655 Free PMC article.
-
Inspired Spine Smart Universal Resource Identifier (SURI): An Adaptive AI Framework for Transforming Multilingual Speech Into Structured Medical Reports.Cureus. 2025 Mar 26;17(3):e81243. doi: 10.7759/cureus.81243. eCollection 2025 Mar. Cureus. 2025. PMID: 40291306 Free PMC article.
-
Evaluating the Usability, Technical Performance, and Accuracy of Artificial Intelligence Scribes for Primary Care: Competitive Analysis.JMIR Hum Factors. 2025 Jul 23;12:e71434. doi: 10.2196/71434. JMIR Hum Factors. 2025. PMID: 40700466 Free PMC article.
References
-
- Shortliffe EH. Biomedical Informatics: Computer Applications in Health Care and Biomedicine. 3rd ed. New York, NY: Springer; 2006.
-
- Taking action against clinician burnout. Washington, D.C., DC: National Academies Press; 2020. National Academies of Sciences, Engineering, and Medicine, National Academy of Medicine, Committee on Systems Approaches to Improve Patient Care by Supporting Clinician Well-Being.
-
- Massachusetts medical society: A crisis in health care: A call to action on physician burnout. https://www.massmed.org/Publications/Research,-Studies,-and-Reports/A-Cr... (accessed 29 Jul 2022)