Fine-tuning AraGPT2 for Hierarchical Arabic Text Classification
Journal: Science, Engineering and Technology (Vol.5, No. 1)Publication Date: 2025-04-30
Authors : Djelloul BOUCHIHA; Abdelghani BOUZIANE; Noureddine DOUMI; Benamar HAMZAOUI;
Page : 55-65
Keywords : ;
Abstract
Text classification consists in attributing a text to its corresponding category. It is a crucial task in natural language processing (NLP), with applications spanning content recommendation, spam detection, sentiment analysis, and topic categorization. While significant advancements have been made in text classification for widely spoken languages, Arabic remains underrepresented despite its large and diverse speaker base. Another challenge is that, unlike flat classification, hierarchical text classification involves categorizing texts into a multi-level taxonomy. This adds layers of complexity, particularly in distinguishing between closely related categories within the same super-class. To tackle these challenges, we propose a novel approach using AraGPT2, a variant of the Generative Pre-trained Transformer 2 (GPT-2) model adapted specifically for Arabic. Fine-tuning AraGPT2 for hierarchical text classification leverages the model's pre-existing linguistic knowledge and adapts it to recognize and classify Arabic text according to hierarchical structures. Fine-tuning, in this context, refers to the process of training a pre-trained model on a specific task or dataset to improve its performance on that task. Our experiments and comparative study demonstrate the efficiency of our solution. The fine-tuned AraGPT2 classifier achieves a hierarchical HF score of 80.64%, outperforming the machine learning-based classifier, which scores 41.90%.
Other Latest Articles
- Investigation of the Properties of Waste Expanded Polystyrene (EPS) Modified Bitumen
- Towards Sustainability: Tracking Carbon Footprint Trends at Ezemvelo KZN Wildlife
- Uncertainty Quantification and Sensitivity Analysis of Concrete Structure Using Multi-Linear Regression Technique
- Performance Analysis of Fractionalized Order PID Controller-based on Metaheuristic Optimisation Algorithms for Vehicle Cruise Control Systems
- FOLKLORDA MAXSUS JANRLARNING BOLALAR TARBIYASIDAGI AMALIY O’RNI
Last modified: 2025-06-28 06:33:43