Exploring collaborative caption editing to augment video-based learning
- PMID: 35855355
- PMCID: PMC9285185
- DOI: 10.1007/s11423-022-10137-5
Exploring collaborative caption editing to augment video-based learning
Abstract
Captions play a major role in making educational videos accessible to all and are known to benefit a wide range of learners. However, many educational videos either do not have captions or have inaccurate captions. Prior work has shown the benefits of using crowdsourcing to obtain accurate captions in a cost-efficient way, though there is a lack of understanding of how learners edit captions of educational videos either individually or collaboratively. In this work, we conducted a user study where 58 learners (in a course of 387 learners) participated in the editing of captions in 89 lecture videos that were generated by Automatic Speech Recognition (ASR) technologies. For each video, different learners conducted two rounds of editing. Based on editing logs, we created a taxonomy of errors in educational video captions (e.g., Discipline-Specific, General, Equations). From the interviews, we identified individual and collaborative error editing strategies. We then further demonstrated the feasibility of applying machine learning models to assist learners in editing. Our work provides practical implications for advancing video-based learning and for educational video caption editing.
Keywords: Caption transcription; Collaborative editing; Lecture video caption editing; Technology-assisted editing.
© Association for Educational Communications and Technology 2022.
Figures





Similar articles
-
Listen or Read? The Impact of Proficiency and Visual Complexity on Learners' Reliance on Captions.Behav Sci (Basel). 2025 Apr 17;15(4):542. doi: 10.3390/bs15040542. Behav Sci (Basel). 2025. PMID: 40282163 Free PMC article.
-
Towards a More Inclusive Learning Environment: The Importance of Providing Captions That Are Suited to Learners' Language Proficiency in the UDL Classroom.Stud Health Technol Inform. 2022 Sep 2;297:533-540. doi: 10.3233/SHTI220884. Stud Health Technol Inform. 2022. PMID: 36073435
-
ICT multimedia learning affordances: role and impact on ESL learners' writing accuracy development.Heliyon. 2021 Jul 10;7(7):e07517. doi: 10.1016/j.heliyon.2021.e07517. eCollection 2021 Jul. Heliyon. 2021. PMID: 34307944 Free PMC article.
-
On-Screen Texts in Audiovisual Input for L2 Vocabulary Learning: A Review.Front Psychol. 2022 May 13;13:904523. doi: 10.3389/fpsyg.2022.904523. eCollection 2022. Front Psychol. 2022. PMID: 35645916 Free PMC article. Review.
-
How to Shoot and Edit High-Quality Surgical Videos for Hand and Upper Extremity Surgery.J Hand Surg Am. 2022 May;47(5):471-474. doi: 10.1016/j.jhsa.2021.09.021. Epub 2021 Dec 10. J Hand Surg Am. 2022. PMID: 34903392 Review.
References
-
- Aggarwal CC, Zhai C. Mining text data. Springer; 2012. A survey of text classification algorithms; pp. 163–222.
-
- Alvarez A, Martínez-Hinarejos C-D, Arzelus H, Balenciaga M, del Pozo A. Improving the automatic segmentation of subtitles through conditional random field. Speech Communication. 2017;88:83–95. doi: 10.1016/j.specom.2017.01.010. - DOI
-
- Amershi S, Cakmak M, Knox WB, Kulesza T. Power to the people: The role of humans in interactive machine learning. Ai Magazine. 2014;35(4):105–120. doi: 10.1609/aimag.v35i4.2513. - DOI
-
- Amos, J. R. , Zhang, Z. , Angrave, L. , Liu, H., & Shen, Y. (2021). A udl-based large-scale study on the needs of students with disabilities in engineering courses. In 2021 asee virtual annual conference content access.
-
- Angrave, L., Jensen, K., Zhang, Z., Mahipal, C., Mussulman, D., Schmitz, C. D., & Kooper (2020a). Improving student accessibility, equity, course performance, and lab skills: How introduction of classtranscribe is changing engineering education at the university of illinois. In Asee annual conference & exposition
LinkOut - more resources
Full Text Sources