A Comparative Study of Transformer-based Models for Hate-Speech Detection in English-Kiswahili Code-Switched Social Media Text
Journal: International Journal of Advanced Trends in Computer Science and Engineering (IJATCSE) (Vol.13, No. 5)Publication Date: 2024-10-15
Authors : Fredrick Ng'ang'a Njung'e Aaron Mogeni Oirere Rachael Njeri Ndung'u;
Page : 181-186
Keywords : Code-Switching; English-Kiswahili; Hate Speech; Multilingual Language Understanding; Text Classification; Transformers;
Abstract
The transformer architecture, first introduced in 2017 by researchers at Google, has revolutionized natural language processing in various tasks, including text classification. This architecture formed the basis of future models such as those used in hate speech detection in code-switched text. In this research, we conduct a comparative study of transformer-based models for hate speech detection in English-Kiswahili code-switched text. First, the models were compared as feature extractors using a traditional classifier and then as end-to-end classifiers. The three multilingual transformer-based models compared include mBERT, mDistilBERT and XLM-RoBERTa, using SVM as the traditional classifier for the extracted features. The HateSpeech_Kenya dataset, sourced from Kaggle, was utilized in this study. As a feature extractor, mBERT's hidden states trained the highest-performing SVM with an accuracy of 0.5461 and a macro f1 score of 0.40. Among the three models evaluated, XLM-RoBERTa achieved the highest accuracy of 0.6069 and a macro f1 score of 0.49 on a balanced dataset. In contrast, mBERT achieved the highest accuracy of 0.7820 and a macro f1 score of 0.53 on an imbalanced dataset. The comparative study establishes that using transformer-based models as end-to-end classifiers generally performs better than using them as feature extractors with traditional classifiers. This is because directly training the models allows them to learn more task-specific features. Furthermore, the varying performance across balanced and imbalanced datasets highlights the need for careful model selection based on the dataset characteristics and specific task requirements
Other Latest Articles
- UNVEILING THE INFLUENCES ON MORAL JUDGMENT: A STUDY OF GRADE 9 STUDENTS IN ODISHA CONSIDERING BOARD OF STUDIES, SOCIO-ECONOMIC STATUS, INTELLIGENCE, AND GENDER
- PROMOTION OF GEOGRAPHICAL INDICATIONS: A CATALYST FOR RURAL ECONOMIC GROWTH IN INDIA
- STEM CELL RESEARCH IN INDIA: A CRITICAL STUDY OF SOCIO-LEGAL ISSUES
- NATURE AS A NARRATIVE FORCE: THE EPHEMERAL DICHOTOMY OF NATURE IN KEKI DARUWALLA’S POEMS
- EMOTIONAL COMPETENCE AND SELF-EFFICACY OF PUPIL TEACHERS OF SAMASTIPUR DISTRICT (BIHAR)
Last modified: 2024-10-20 22:11:45