Supervised Classifiers to Identify Hate Speech on English and Spanish Tweets

TítuloSupervised Classifiers to Identify Hate Speech on English and Spanish Tweets
AutoresSattam Almatarneh, Pablo Gamallo, Francisco J. Ribadas Pena, and Alexey Alexeev
TipoComunicación para congreso
Fonte 21st International Conference on Asia-Pacific Digital Libraries, Kuala Lumpur, Malaysia, 2019.
AbstractConsistently with social and political concern about hatred and harassment through social media, in recent years, automatic hate speech detection and offensive behavior in social media are gaining a lot of attention. In this paper, we examine the performance of several supervised classifiers in the process of identifying hate speech on Twitter. More precisely, we do an empirical study that analyzes the influence of two types of linguistic features (n-grams, word embeddings) when they are used to feed different supervised machine learning classifiers: Support Vector Machine (SVM), Gaussian Naive Bayes (GNB), Complement Naive Bayes (CNB), Decision Tree (DT), Nearest Neighbors (KN), Random Forest (RF) and Neural Network (NN). The experiments we have carried out show that CNB, SVM, and RF are better than the rest classifiers in English and Spanish languages by taking into account all features.
Palabras chaveHate speech, Sentiment Analysis, linguistic features, Classification, Supervised Machine Learning