Search in:

Computer Methods in Biomechanics and Biomedical Engineering: Imaging & Visualization Volume 9, 2021 - Issue 3: Special Issue: AE-CAI 2020

Submit an article Journal homepage

493

Views

CrossRef citations to date

Altmetric

Research Article

LapFormer: surgical tool detection in laparoscopic surgical video using transformer architecture

Satoshi KondoKonica Minolta, Inc., Osaka, JapanCorrespondence[email protected]
View further author information

Pages 302-307 | Received 16 Sep 2020, Accepted 07 Oct 2020, Published online: 21 Oct 2020

Cite this article
https://doi.org/10.1080/21681163.2020.1835550
CrossMark

Full Article
Figures & data
References
Citations
Metrics
Reprints & Permissions

References

Ba JL, Kiros JR, Hinton GE. 2016. Layer normalization. arXiv. 1607:06450.
Google Scholar
Cho K, van Merrienboer B, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. 2014. Learning phrase representations using RNN encoder-decoder for statistical machine translation. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing, EMNLP 2014, October 25–29, 2014, Doha, Qatar.
Google Scholar
Girdhar R, Carreira J, Doersch C, Zisserman A. 2019. Video action transformer network. 32nd IEEE conference on computer vision and pattern recognition, Long Beach, CA, United States, June 16–20, 2019.
Google Scholar
He K, Zhang X, Ren S, Sun J. 2016. Deep residual learning for image recognition. 29th IEEE conference on computer vision and pattern recognition, Las Vegas, Nevada, United States, June 27–30, 2016.
Google Scholar
Hochreiter S, Schmidhuber J. 1997. Long short-term memory. Neural Comput. 9(8):1735–1780. doi:10.1162/neco.1997.9.8.1735.
Google Scholar
Jin Y, Li H, Dou Q, Chen H, Qin J, Fu CW, Heng PA. 2020. Multi-task recurrent convolutional network with correlation loss for surgical video analysis. Med Image Anal. 59:101572. doi:10.1016/j.media.2019.101572.
Google Scholar
Kitaev N, Kaiser L, Levskaya A: Reformer: the efficient transformer. 2020. International Conference on Learning Representaitons (ICLR), Virtual Conference, Formerly Addis Ababa, Ethiopia.
Google Scholar
Krizhevsky A, Sutskever I, Hinton GE. 2012. Imagenet classification with deep convolutional neural networks. 26th Conference on Neural Information Processing Systems, NIPS 2012, Lake Tahoe, Nevada, United States, Dec. 3–8, 2012.
Google Scholar
Namazi B, Sankaranarayanan G, Devarajan V. 2019. LapTool-Net: a contextual detector of surgical tools in laparoscopic videos based on recurrent convolutional neural networks. arXiv. 1905:08983.
Google Scholar
Primus MJ, Schoeffmann K, Böszörmenyi L. 2015. Instrument classification in laparoscopic videos. 13th International Workshop on Content-Based Multimedia Indexing, CBMI 2015, Prague, Czech Republic, June 10–12, 2015.
Google Scholar
Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, Huang Z, Karpathy A, Khosla A, Bernstein M, et al. 2015. ImageNet large scale visual recognition challenge. Int J Comput Vision (IJCV). 115(3):211–252.
Google Scholar
Sokolova M, Lapalme G. 2009. A systematic analysis of performance measures for classification tasks. Inf Process Manag. 45(4):427–437.
Google Scholar
Szegedy C, Liu W, Jia Y, Sermanet P, Reed S, Anguelov D, Erhan D, Vanhoucke V, Rabinovich A. 2015. Going deeper with convolutions. 28th IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2015, Boston, MA, United States, June 7–12, 2015.
Google Scholar
Twinanda AP, Shehata S, Mutter D, Marescaux J, De Mathelin M, Padoy N. 2016. Endonet: A deep architecture for recognition tasks on laparoscopic videos. IEEE Trans Med Imaging. 36(1):86–97. doi:10.1109/TMI.2016.2593957.
Google Scholar
Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Polosukhin I. 2017. Attention is all you need. 31st Conference on Neural Information Processing Systems, NeurIPS 2017, Long Beach, CA, United States, Dec. 5–7, 2017.
Google Scholar
Zhang M, Lucas J, Ba J, Hinton GE. 2019. Lookahead Optimizer: k steps forward, 1 step back. 33rd Conference on Neural Information Processing Systems, NeurIPS 2019, Vancouver Canada, United States, Dec. 10–12, 2019.
Google Scholar

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Order Reprints Request Corporate Permissions

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

Request Academic Permissions

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.

Share icon
Back to Top

Related research

People also read lists articles that other readers of this article have read.

Recommended articles lists articles that we recommend and is powered by our AI driven recommendation engine.

Cited by lists all citing articles based on Crossref citations.
Articles with the Crossref icon will open in a new tab.

People also read
Recommended articles
Cited by

To cite this article:

Reference style: APA Chicago Harvard

Citation copied to clipboard

Reference styles above use APA (6th edition), Chicago (16th edition) & Harvard (10th edition)

Download citation

Download a citation file in RIS format that can be imported by citation management software including EndNote, ProCite, RefWorks and Reference Manager.

Choose format: RIS BibTex RefWorks Direct Export

Choose options: Citation Citation & abstract Citation & references

LapFormer: surgical tool detection in laparoscopic surgical video using transformer architecture

References

Information for

Open access

Opportunities

Help and information

Your download is now in progress and you may close this window

Login or register to access this feature

LapFormer: surgical tool detection in laparoscopic surgical video using transformer architecture

References

Reprints and Corporate Permissions

Academic Permissions

Related research

To cite this article:

Download citation

Information for

Open access

Opportunities

Help and information

Keep up to date