ABSTRACT
Nowadays, a number of scripts are used for writing. Script identification finds many applications like sorting and preparing an online database of documents. Identifying these scripts, especially with different orientations and scales, is an important and challenging problem in document image analysis. This paper proposed a new scheme for script identification from word images using skew and scale robust log-polar curvelet features. These word images are first extracted in the form of text-patches from documents using Gaussian filtering. Thereafter, texture features are calculated using curvelet transform in log-polar domain. Log-polar domain is independent of rotation and scale variations, whereas curvelet transform exhibits directional and anisotropic properties. This helps in the extraction of significant features. For experiments, k-nearest neighbor classifier is employed to identify the scripts, as it has zero training time and is simple to implement. Further, statistical significance test is performed by using two more classifiers, namely random forest and support vector machine. Comprehensive experimentations are carried out on ALPH-REGIM, Pati and Ramakrishnan, PHDIndic_11, and proprietary databases containing printed as well as handwritten texts. Here, bi-script, tri-script, and multi-script identification results are reported. Benchmarking analysis illustrated the effectiveness of the proposed method, where a maximum recall rate of 98.76% has been achieved.
DISCLOSURE STATEMENT
No potential conflict of interest was reported by the authors.
Additional information
Notes on contributors
![](/cms/asset/748e1ffc-9800-44f1-ac26-26e50af2336e/tijr_a_1430516_uf0001_oc.jpg)
Parul Sahare
Parul Sahare received his post-graduate degree (MTech degree (VLSI DESIGN)) from Visvesvaraya National Institute of Technology, Nagpur, Maharastra, India in 2010. Currently, he is pursuing his PhD degree from Visvesvaraya National Institute of Technology, Nagpur, Maharashtra, India. He has a total of three years of academic experience. His area of interest includes signal processing, image processing, pattern recognition.
![](/cms/asset/07ca0d31-06a9-4302-aef5-fbc8b347f613/tijr_a_1430516_uf0002_oc.jpg)
Ravindra E. Chaudhari
Ravindra E. Chaudhari received his BE degree in electronics and telecommunication engineering from Government College of Engineering, Aurangabad, India, in 1999, and MTech degree, in electronics engineering from Visvesvaraya Regional College of Engineering, Nagpur in 2002. He is pursuing his PhD degree in electronics engineering from Visvesvaraya National Institute of Technology, Nagpur, and working as an assistant professor at St. Francis Institute of Technology, Mumbai, India. His research interests are in the areas of image/video and signal processing.
E-mail: [email protected]
![](/cms/asset/e97d1b4e-08d0-4a89-8698-0307b5d4a949/tijr_a_1430516_uf0003_oc.jpg)
Sanjay B. Dhok
Sanjay B. Dhok is Associate Professor in Centre for VLSI & Nanotechnology at Visvesvaraya National Institute of Technology, Nagpur (India). He received his PhD degree in electronics engineering from Visvesvaraya National Institute of Technology, Nagpur, India. He is a member of IEEE society. He has published many research papers in national and international journals and conferences. His area of interest includes signal processing, image processing, data compression, wireless sensor networks, and VLSI design.
E-mail: [email protected]