390
Views
2
CrossRef citations to date
0
Altmetric
Original Articles

A Novel Word-Spotting Method for Handwritten Documents Using an Optimization-Based Classifier

&

References

  • Adamek, T. N. 2007. Word matching using single closed contours for indexing handwritten historical documents. International Journal of Document Analysis and Recognition (IJDAR) 9 (2–4):153–65. doi:10.1007/s10032-006-0024-y.
  • Aldavert, D. A. 2015. A study of Bag-of-Visual-Words representations for handwritten keyword spotting. International Journal on Document Analysis and Recognition (IJDAR) 18 (3):223–34. doi:10.1007/s10032-015-0245-z.
  • Al-Khayat, M. 2014. Learning-based arabic word spotting using a hierarchical classifier. PhD. dis., Computer Science and Software Engineering, Concordia University.
  • Antonacopoulos, A. A. 2007. Special issue on the analysis of historical documents. International Journal on Document Analysis and Recognition 9 (2):75–7. doi:10.1007/s10032-007-0045-1.
  • Ball, G. R. 2006. Segmentation-based and segmentation free approaches to Arabic word spotting. In Tenth International Workshop on Frontiers in Handwriting Recognition, La Baule, France, October 2006.
  • Belongie, S. J. 2002. Shape matching and object recognition using shape contexts. Pattern Analysis and Machine Intelligence, IEEE Transactions On 24 (4):509–22. doi:10.1109/34.993558.
  • Bhardwaj, A. D. 2008. Script independent word spotting in multilingual documents. In the Second International workshop on “Cross Lingual Information Access-Jan 2008”, 48–54.
  • Blessy Varghese, S. G. 2015. A survey on various word spotting techniques for content based document image retrieval. International Journal of Computer Science and Information Technologies 3 (6):2682–86.
  • Cheriet, M. A. 2012. A robust word spotting system for historical arabic manuscripts. In Guide to OCR for arabic scripts, ed. by H. E. Volker Märgner, 453–84. London: Springer.
  • Dash, M., and H. Liu. 1997. Feature selection for classification. Intelligent Data Analysis 1 (3):131–56. doi:10.1016/S1088-467X(97)00008-5.
  • David, J. E. 2005. Making latin manuscripts searchable using gHMM’s. Advances in Neural Information Processing Systems 17: Proceedings of the 2004 Conference, vol. 17, 385. Vancouver, British Columbia, Canada: MIT Press.
  • Doermann, D. 1998. The indexing and retrieval of document images: A survey. Computer Vision and Image Understanding 70 (3):287–98. doi:10.1006/cviu.1998.0692.
  • Fernández, D. J. 2011. Handwritten word spotting in old manuscript images using a pseudo-structural descriptor organized in a hash structure. In Pattern recognition and image analysis, ed. J. M. Jordi Vitrià, 628–35. Spain: Springer, Berlin, Heidelberg.
  • Fischer, A. A. 2012. Lexicon-free handwritten word spotting using character HMMs. Pattern Recognition Letters 33 (7):934–42. doi:10.1016/j.patrec.2011.09.009.
  • Fischer, A. A. 2010. HMM based word spotting in handwritten documents using subword models. 2010 20th international conference onPattern recognition (ICPR), 3416–19. Istanbul, Turkey: IEEE.
  • Fornés, A. V. 2011. A keyword spotting approach using blurred shape model-based descriptors. Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, 83–90. Beijing, China: ACM, September 2011.
  • Haji, M. 2012. Arbitrary Keyword Spotting in Handwritten Documents. PhD thesis, Computer Science at Concordia University.
  • Hangarge, M. A. 2016. Gabor wavelets based word retrieval from kannada documents. Procedia Computer Science 79:441–48. doi:10.1016/j.procs.2016.03.057.
  • Liu, H. and H. Motoda. 1998. Feature selection for knowledge discovery and data mining, vol. 454, Norwell, MA, USA: Kluwer Academic Publishers.
  • Huang, L. F.-H.-L. 2011. Keyword spotting in offline Chinese handwritten documents using a statistical model. 2011 International Conference on Document Analysis and Recognition (ICDAR), 78–82. Beijing, China: IEEE, September 2011.
  • Jain, A. K. 2003. Indexing and retrieval of on-line handwritten documents. 2003 12th International Conference on Document Analysis and Recognition, 655–55. Edinburgh, UK: IEEE Computer Society, August 2003.
  • Lam, L. S.-W. 1992. Thinning Methodologies-A comprehensive survey. IEEE Transactions on Pattern Analysis and Machine Intelligence, 14(9): 869–885.
  • Manmatha, R. C. 1996. Word spotting: A new approach to indexing handwriting. Proceedings CVPR’96, 1996 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 1996, 631–37. San Francisco, CA, USA: IEEE.
  • Moghaddam, R. F. 2009. Application of multi-level classifiers and clustering for automatic word spotting in historical document images. 10th International Conference on Document Analysis and Recognition, 2009. ICDAR’09, 511–15. Barcelona, Spain: IEEE.
  • Mozaffari, S., H. El Abed, V. Margner, K. Faez, and A. Amirshahi. 2008. IfN/Farsi-Database: A database of Farsi handwritten city names. International Conference on Frontiers in Handwriting Recognition, Montreal, Quebec, Canada, August 2008.
  • Murugappan, A. A. 2011. A survey of keyword spotting techniques for printed document images. Artificial Intelligence Review 35 (2):119–36. doi:10.1007/s10462-010-9187-5.
  • Pechwitz, M., S. S. Maddouri, V. Margner, N. Ellouze, and H. Amiri. 2002. IFN/ENIT-database of handwritten Arabic words. Proc. of CIFED, vol. 2, 127–36, Hammamet, Tunisia, October 2002.
  • Rath, R. M. 2003a. Indexing of handwritten historical documents-recent progress. Proceedings 2003 Symposium on Document Image Understanding Technology, 77. Greenbelt, Maryland: UMD.
  • Rath, T. M. 2003b. Features for word spotting in historical manuscripts. Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on, 218–22. Edinburgh, UK: IEEE.
  • Rath, T. M. 2003c. Word image matching using dynamic time warping. Proceedings of 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, vol. 2, II–521. IEEE.
  • Riba, P. A. 2015. Handwritten word spotting by inexact matching of grapheme graphs. 2015 13th International Conference on Document Analysis and Recognition (ICDAR), 781–85. Tunis, Tunisia: IEEE.
  • Rodrıguez, J. A. 2008. Local gradient histogram features for word spotting in unconstrained handwritten documents. Int. Conf. on Frontiers in Handwriting Recognition Montreal, Quebec, Canada, August 2008.
  • Rodríguez-Serrano, J. A. 2009. Handwritten word-spotting using hidden Markov models and universal vocabularies. Pattern Recognition 42 (9):2106–16. doi:10.1016/j.patcog.2009.02.005.
  • Rodríguez-Serrano, J. A. 2012. A model-based sequence similarity with application to handwritten word spotting. Pattern Analysis and Machine Intelligence, IEEE Transactions On 34 (11):2108–20. doi:10.1109/TPAMI.2012.25.
  • Rose, R. C. 1990. A hidden Markov model based keyword recognition system. 1990 International Conference on Acoustics, Speech, and Signal Processing, 1990. ICASSP-90, 129–32. Albuquerque, NM, USA: IEEE.
  • Rothfeder, J. L. 2003. Using corner feature correspondences to rank word images by similarity. Computer Vision and Pattern Recognition Workshop, 2003. CVPRW’03. Conference on, vol. 3, 30–30. Madison, Wisconsin, USA: IEEE.
  • Rusiñol, M. A. 2014. Boosting the handwritten word spotting experience by including the user in the loop. Pattern Recognition 47 (3):1063–72. doi:10.1016/j.patcog.2013.07.008.
  • Rusinol, M. D. 2011. Browsing heterogeneous document collections by a segmentation-free word spotting method. Document Analysis and Recognition (ICDAR), 2011 International Conference on, 63–67. Beijing, China: IEEE.
  • Saabni, R. A. 2011a. Fast key-word searching via embedding and Active-DTW. 2011 International Conference on Document Analysis and Recognition (ICDAR), 68–72. Beijing, China: IEEE.
  • Saabni, R. A.-S. 2008. Keyword searching for Arabic handwritten documents. Proc. 11th International Conference on Frontiers in Handwriting Recognition, (ICFHR), vol. 11, 271–77. Montreal, Quebec, Canada: IEEE.
  • Sagheer, M. W. 2010. Novel Word Recognition and Word Spotting Systems for Offline Urdu Handwriting. PhD, Thesis, Concordia University, Computer Science Department, Montreal, Quebec, Canada.
  • Sari, T. L. 2002. Off-line handwritten Arabic character segmentation algorithm: ACSA. Proceedings of Eighth International Workshop onFrontiers in Handwriting Recognition, 2002, 452–57. Ontario, Canada: IEEE.
  • Saykol, E. A. 2004. Content-based retrieval of historical Ottoman documents stored as textual images. IEEE Transactions on Image Processing 13 (3):314–25. doi:10.1109/TIP.2003.821114.
  • Shahab, S. A.-K. 2006. Computer aided indexing of historical manuscripts. 2006 International Conference on Computer Graphics, Imaging and Visualisation, 287–95. Sydney, Qld, Australia: IEEE.
  • Shenoy, P. D., K. Srinivasa, K. Venugopal, and L. M. Patnaik. 2005. Dynamic association rule mining using genetic algorithms. Intelligent Data Analysis 9 (5):439–53.
  • Srihari, S. A. 2005. Handwritten arabic word spotting using the cedarabic document analysis system. Proc. Symposium on Document Image Understanding Technology (SDIUT-05), 123–32. Adelphi, Maryland, Novamber 2005.
  • Stejic, Z., Y. Takama, and K. Hirota. 2004. Modified hierarchical genetic algorithm for relevance feedback in image retrieval. Intelligent Data Analysis 8 (4):363–84.
  • Terasawa, K. A. 2009. Slit style HOG feature for document image word spotting. Document Analysis and Recognition, 2009. ICDAR’09. 10th International Conference on 116–20. Spain: IEEE, July 2009.
  • V. Frinken, A. R. 2012. A novel word spotting method based on recurrent neural networks, IEEE Transactions on Pattern Analysis and Machine Intelligence. Pattern Analysis and Machine Intelligence, IEEE Transactions On 34 (2):211–24. doi:10.1109/TPAMI.2011.113.
  • Wshah, S., G. Kumar, and V. Govindaraju. 2014. Statistical script independent word spotting in offline handwritten documents. Pattern Recognition 47 (3):1039–50. doi:10.1016/j.patcog.2013.09.019.
  • Wshah, S. G. 2012. Script independent word spotting in offline handwritten documents based on hidden Markov models. 2012 International Conference on Frontiers in Handwriting Recognition (ICFHR), 14–19. Bari, Italy: IEEE, September 2012.
  • Ye, Q. A. 2015. Text detection and recognition in imagery: A survey. IEEE Transactions on Pattern Analysis and Machine Intelligence 37 (7):1480–500. doi:10.1109/TPAMI.2014.2366765.
  • Zhang, B. S. 2003. Word image retrieval using binary features. Electronic Imaging 2004, 45–53. International Society for Optics and Photonics. San Jose, California, USA: International Society for Optics and Photonics, December 2003.

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.