213
Views
4
CrossRef citations to date
0
Altmetric
Computers & Computing

Deep Learning-based Hate Speech Detection in Code-mixed Tamil Text

ORCID Icon &
 

Abstract

Social media is a great source of communication. People use various social media platforms, such as Twitter, Facebook, and Instagram, for sharing their ideas, opinions, and feelings. Users of different age groups, cultures, education backgrounds manipulate these powerful mediums of communication. Even though it gives all the benefits of knowledge sharing among the users, it has a dark side too. Despite setting restrictions from the corresponding sites, many users use abusive language to blemish the status and image of someone. So it is highly the need of the hour for the government or the particular social media platform to sift out those unwanted hate texts before diffusing them. Finding the hate text is one of the emerging research topics in Natural Language Processing where the model predicts the given text as hate text or not. This automated hate text detection becomes tedious when we consider the Indian languages due to a lack of data. Moreover, Indian people are multilingual and use code-mixed patterns to express their thoughts. The unavailability of the annotated Tamil-English dataset and the lack of a standard model make this task more challenging. In our paper, to handle such code-mixed data, a dataset is created with 10000 Tamil-English code-mixed texts collected from Twitter. These are annotated as hate text/non-hate text. In this paper, we use a synonym-based Bi-LSTM model for classifying hate non-hate text in tweets.

DISCLOSURE STATEMENT

No potential conflict of interest was reported by the author(s).

Additional information

Notes on contributors

S. Anbukkarasi

S Anbukkarasi received the BTech degree in information technology and ME degree in computer science and engineering. Currently, she is pursuing a PhD degree. Her research area includes natural language processing and deep learning.

S. Varadhaganapathy

S Varadhaganapathy is working as professor in the Department of Information Technology at Kongu Engineering College, Erode. His area of interest includes deep learning and image processing. Email: [email protected]

Reprints and Corporate Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

To request a reprint or corporate permissions for this article, please click on the relevant link below:

Academic Permissions

Please note: Selecting permissions does not provide access to the full text of the article, please see our help page How do I view content?

Obtain permissions instantly via Rightslink by clicking on the button below:

If you are unable to obtain permissions via Rightslink, please complete and submit this Permissions form. For more information, please visit our Permissions help page.