472
Views
27
CrossRef citations to date
0
Altmetric
Original Articles

Semantic Feature Clustering for Sentiment Analysis of English Reviews

&
 

ABSTRACT

Sentiment analysis research has increased tremendously in recent times due to the wide range of business and social applications. Motivation behind sentiment analysis is that it provides companies’ methods to determine the product acceptance and ways to improve its quality. It also helps users to take purchasing decisions. Various parsing schemes/feature extraction methods have been proposed in the literature to process unstructured text to extract patterns that may help machine learning model to learn. The main limitation of the existing feature extraction techniques is the sparseness of the data and inability to incorporate semantic information. In this paper, a new feature extraction method is proposed, namely clustering features. Proposed feature extraction technique focuses on alleviating the data sparsity faced by supervised sentiment analysis by clustering of semantic features. Proposed clustering features are capable of including semantic information and alleviating data sparseness for machine learning algorithm. In all the experiments, support vector machine and Boolean Multinomial Naive Bayes (BMNB) machine learning algorithms are used for classification. Experimental results show that the proposed clustering features significantly outperform other features for document-level sentiment classification. All the experiments are performed on standard movie review data-set and product review data-sets, namely book, electronics, kitchen appliances.

Additional information

Notes on contributors

Basant Agarwal

Basant Agarwal is a doctoral candidate at Malaviya National Institute of Technology, Jaipur, India. He received his Master's degree in computer engineering from Malaviya National Institute of Technology, Jaipur, India. His research interest is in natural language processing, machine learning, sentiment analysis and opinion mining.

Email: [email protected]

Namita Mittal

Namita Mittal has been working as an assistant professor for the past 18 years at the Department of Computer Engineering, Malaviya National Institute of Technology, Jaipur, India. She has a PhD in computer engineering from Malaviya National Institute of Technology, Jaipur, India. She is involved in teaching undergraduate and graduate courses like database management, information retrieval, data mining, natural language processing and semantic web. She has published several research papers in international conferences and journals of repute.

Email: [email protected]

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.