350
Views
3
CrossRef citations to date
0
Altmetric
Research Article

Towards video-based surgical workflow understanding in open orthopaedic surgery

, , , , , , & show all
Pages 286-293 | Received 21 Sep 2020, Accepted 07 Oct 2020, Published online: 02 Dec 2020

References

  • Bayliss LE, Culliford D, Monk AP, Glyn-Jones S, Prieto-Alhambra D, Judge A, Cooper C, Carr AJ, Arden NK, Beard DJ, et al. 2017. The effect of patient age at intervention on risk of implant revision after total replacement of the hip or knee: a population-based cohort study. Lancet. 389:1424–1430. doi:10.1016/S0140-6736(17)30059-4.
  • Bodenstedt S, Wagner M, Katić D, Mietkowski P, Mayer B, Kenngott H, Müller-Stich B, Dillmann R, Speidel S. 2017. Unsupervised temporal context learning using convolutional neural networks for laparoscopic workflow analysis. arXiv Preprint arXiv. 1702:03684.
  • Cook R, Davidson P, Martin R. 2019. More than 80% of total knee replacements can last for 25 years. BMJ. 367. BMJ Publishing Group Ltd. doi:10.1136/bmj.l5680
  • Deng J, Dong W, Socher R, Li LJ, Li K, Fei-Fei L. 2009. Imagenet: A large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition, Miami, Florida. p. 248–255.
  • Deng J, Guo J, Xue N, Zafeiriou S. 2019. Arcface: additive angular margin loss for deep face recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Long Beach, CA. p. 4690–4699.
  • EndoVis Challenge. https://endovis.grand-challenge.org/. 2020, accessed Mar 2020.
  • Flouty E, Zisimopoulos O, Stoyanov D. 2018. FaceOff: anonymizing videos in the operating rooms. In: OR 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, and Skin Image Analysis, Granada, Spain: Springer. p. 30–38.
  • Funke I, Jenke A, Mees ST, Weitz J, Speidel S, Bodenstedt S. 2018. Temporal coherence-based self- supervised learning for laparoscopic workflow analysis. In: OR 2.0 Context-Aware Operating Theaters, Computer Assisted Robotic Endoscopy, Clinical Image-Based Procedures, and Skin Image Analysis, Granada, Spain: Springer. p. pp. 85–93.
  • Graves A, Fernández S, Schmidhuber J. 2005. Bidirectional LSTM networks for improved phoneme classification and recognition. Int Conf Artif Neural Networks. 3697:799–804.
  • Houghton CE, Casey D, Shaw D, Murphy K. 2010. Ethical challenges in qualitative research: examples from practice. Nurse Res. 18:15–25.
  • Hu J, Shen L, Sun G. 2018. Squeeze-and-excitation networks. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT. p. 7132–7141.
  • Jain V, Learned-Miller E. 2010. FDDB: A benchmark for face detection in unconstrained settings. university of massachusetts, Amherst. Tech. Rep. UM-CS-2010-009 2. p. 8.
  • Kadkhodamohammadi A, Gangi A, de Mathelin M, Padoy N. 2017. Articulated clinician detection using 3d pictorial structures on RGB-D data. Med Image Anal. 35:215–224. doi:10.1016/j.media.2016.07.001.
  • Kadkhodamohammadi A, Luengo I, Barbarisi S, Taleb H, Flouty E, Stoyanov D. 2019. Feature aggregation decoder for segmenting laparoscopic scenes. In: OR 2.0 Context-Aware Operating Theaters and Machine Learning in Clinical Neuroimaging; Cham: Springer International Publishing. p. 3–11.
  • Li J, Wang Y, Wang C, Tai Y, Qian J, Yang J, Wang C, Li J, Huang F. 2019. DSFD: dual shot face detector. In: The IEEE Conference on Computer Vision and Pattern Recognition (CVPR); June. Long Beach, CA. p. 5060–5069.
  • Maier-Hein L, Vedula SS, Speidel S, Navab N, Kikinis R, Park A, Eisenmann M, Feussner H, Forestier G, Giannarou S, et al. 2017. Surgical data science for next-generation interventions. Nat Biomed Eng. 1:691–696.
  • Padoy N. 2019. Machine and deep learning for workflow recognition during surgery. Minim Invasive Ther Allied Technol. 28:82–90.
  • Simonyan K, Zisserman A. 2015. Very deep convolutional networks for large-scale image recognition. In International Conference on Learning Representations. San Diego, CA.
  • Srivastav V, Gangi A, Padoy N, 2019. Human pose estimation on privacy-preserving low- resolution depth images. In: Medical Image Computing and Computer Assisted Intervention – MIC- CAI 2019; Shenzhen, China: Springer. p. 583–591.
  • Twinanda AP, Alkan EO, Gangi A, de Mathelin M, Padoy N. 2015. Data-driven spatio-temporal RGBD feature encoding for action recognition in operating rooms. Int J Comput Assisted Radiol Surg. 10:pp. 737–747. doi:10.1007/s11548-015-1186-1.
  • Twinanda AP, Shehata S, Mutter D, Marescaux J, De Mathelin M, Padoy N. 2016a. Endonet: A deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imag. 36:pp. 86–97. doi:10.1109/TMI.2016.2593957.
  • Twinanda AP, Winata P, Gangi A, Mathelin M, Padoy N. 2016b. Multi-stream deep architecture for surgical phase recognition on multi-view RGBD videos. In: Proc. M2CAI Workshop MICCAI. Athens, Greece. p. pp. 1–8.
  • Vercauteren T, Unberath M, Padoy N, Navab N. 2020. Cai4cai: the rise of contextual artificial intelligence in computer assisted interventions. Proc IEEE. 108:198–214.
  • Yang S, Luo P, Loy CC, Tang X. 2016. WIDER FACE: A face detection benchmark. In: Proceedings of the IEEE conference on computer vision and pattern recognition. Las Vegas, Nevada. p. 5525–5533.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.