Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Dec;14(12):2155-2163.
doi: 10.1007/s11548-019-02025-w. Epub 2019 Jul 2.

Novel evaluation of surgical activity recognition models using task-based efficiency metrics

Affiliations

Novel evaluation of surgical activity recognition models using task-based efficiency metrics

Aneeq Zia et al. Int J Comput Assist Radiol Surg. 2019 Dec.

Abstract

Purpose: Surgical task-based metrics (rather than entire procedure metrics) can be used to improve surgeon training and, ultimately, patient care through focused training interventions. Machine learning models to automatically recognize individual tasks or activities are needed to overcome the otherwise manual effort of video review. Traditionally, these models have been evaluated using frame-level accuracy. Here, we propose evaluating surgical activity recognition models by their effect on task-based efficiency metrics. In this way, we can determine when models have achieved adequate performance for providing surgeon feedback via metrics from individual tasks.

Methods: We propose a new CNN-LSTM model, RP-Net-V2, to recognize the 12 steps of robotic-assisted radical prostatectomies (RARP). We evaluated our model both in terms of conventional methods (e.g., Jaccard Index, task boundary accuracy) as well as novel ways, such as the accuracy of efficiency metrics computed from instrument movements and system events.

Results: Our proposed model achieves a Jaccard Index of 0.85 thereby outperforming previous models on RARP. Additionally, we show that metrics computed from tasks automatically identified using RP-Net-V2 correlate well with metrics from tasks labeled by clinical experts.

Conclusion: We demonstrate that metrics-based evaluation of surgical activity recognition models is a viable approach to determine when models can be used to quantify surgical efficiencies. We believe this approach and our results illustrate the potential for fully automated, postoperative efficiency reports.

Keywords: Machine learning; Robotic-assisted surgery; Surgeon training; Surgical activity recognition.

PubMed Disclaimer

References

    1. BJU Int. 2019 May;123(5):861-868 - PubMed
    1. IEEE Trans Med Imaging. 2019 Jul 25;:null - PubMed
    1. IEEE Trans Med Imaging. 2017 Jan;36(1):86-97 - PubMed
    1. IEEE Trans Med Imaging. 2018 May;37(5):1114-1126 - PubMed
    1. J Grad Med Educ. 2017 Dec;9(6):697-705 - PubMed

LinkOut - more resources