ABSTRACT
Text mining has become a major research topic in which text classification is the important task for finding the relevant information from the new document. Accordingly, this paper presents a semantic word processing technique for text categorization that utilizes semantic keywords, instead of using independent features of the keywords in the documents. Hence, the dimensionality of the search space can be reduced. Here, the Back Propagation Lion algorithm (BP Lion algorithm) is also proposed to overcome the problem in updating the neuron weight. The proposed text classification methodology is experimented over two data sets, namely, 20 Newsgroup and Reuter. The performance of the proposed BPLion is analysed, in terms of sensitivity, specificity, and accuracy, and compared with the performance of the existing works. The result shows that the proposed BPLion algorithm and semantic processing methodology classifies the documents with less training time and more classification accuracy of 90.9%.
Disclosure statement
No potential conflict of interest was reported by the authors.
Notes on contributors
Nihar M. Ranjan obtained BE in computer engineering from North Maharashtra University, Jalgaon, Maharashtra and ME in computer science and engineering from V.T.U, Belgaum, Karnataka in 2000 and 2008, respectively. He is currently working as an assistant professor in Sinhgad Institute of Technology and Science, Narhe, Pune. His research interests are data mining, text mining, and text analytics. He has more than 10 publications in various international journals and conferences.
Rajesh S. Prasad obtained BE and ME in Computer Engg. from North Maharashtra University, Jalgaon, Maharashtra and Pune University, Maharashtra in 1996 and 2003, respectively. He obtained PhD in computer engineering from SRTM University, Nanded, Maharashtra in 2013. He is currently working as a professor in NBN Sinhgad School of Engineering, Pune. His research interests are data mining, text mining, and text analytics. He has more than 35 publications in various international journals and conferences.