Video-based assessment of intraoperative surgical skill
- PMID: 35635639
- PMCID: PMC10323985
- DOI: 10.1007/s11548-022-02681-5
Video-based assessment of intraoperative surgical skill
Abstract
Purpose: Surgeons' skill in the operating room is a major determinant of patient outcomes. Assessment of surgeons' skill is necessary to improve patient outcomes and quality of care through surgical training and coaching. Methods for video-based assessment of surgical skill can provide objective and efficient tools for surgeons. Our work introduces a new method based on attention mechanisms and provides a comprehensive comparative analysis of state-of-the-art methods for video-based assessment of surgical skill in the operating room.
Methods: Using a dataset of 99 videos of capsulorhexis, a critical step in cataract surgery, we evaluated image feature-based methods and two deep learning methods to assess skill using RGB videos. In the first method, we predict instrument tips as keypoints and predict surgical skill using temporal convolutional neural networks. In the second method, we propose a frame-wise encoder (2D convolutional neural network) followed by a temporal model (recurrent neural network), both of which are augmented by visual attention mechanisms. We computed the area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and predictive values through fivefold cross-validation.
Results: To classify a binary skill label (expert vs. novice), the range of AUC estimates was 0.49 (95% confidence interval; CI = 0.37 to 0.60) to 0.76 (95% CI = 0.66 to 0.85) for image feature-based methods. The sensitivity and specificity were consistently high for none of the methods. For the deep learning methods, the AUC was 0.79 (95% CI = 0.70 to 0.88) using keypoints alone, 0.78 (95% CI = 0.69 to 0.88) and 0.75 (95% CI = 0.65 to 0.85) with and without attention mechanisms, respectively.
Conclusion: Deep learning methods are necessary for video-based assessment of surgical skill in the operating room. Attention mechanisms improved discrimination ability of the network. Our findings should be evaluated for external validity in other datasets.
Keywords: Cataract surgery; Deep learning; Surgical skill; Video-based assessment.
© 2022. CARS.
Conflict of interest statement
•
Figures




References
-
- Agresti Alan. Categorical data analysis, volume 482. John Wiley & Sons, 2003.
-
- [] Bahdanau Dzmitry, Cho Kyunghyun, and Bengio Yoshua. Neural machine translation by jointly learning to align and translate. In 3rd International Conference on Learning Representations, 2015.
-
- Bettadapura Vinay, Schindler Grant, Plötz Thomas, and Essa Irfan. Augmenting bag-of-words: Data-driven discovery of temporal and structural information for activity recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2619–2626, 2013.
-
- Birkmeyer John D, Finks Jonathan F, O’Reilly Amanda, Oerline Mary, Carlin Arthur M, Nunn Andre R, Dimick Justin, Banerjee Mousumi, and Birkmeyer Nancy JO. Surgical skill and complication rates after bariatric surgery. New England Journal of Medicine, 369(15):1434–1442, 2013. - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources