Constructing twitter corpus of Iraqi Arabic Dialect (CIAD) for sentiment analysis
Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.22, No. 2)Publication Date: 2022-28-04
Authors : Hassoun Al-Jawad M.M. Alharbi H. Almukhtar A.F. Alnawas A.A.;
Page : 308-316
Keywords : sentiment analysis; data mining; support vector machine; user behaviors; social media mining;
Abstract
The number of Twitter users in Iraq has increased significantly in recent years. Major events, the political situation in the country, had a significant impact on the content of Twitter and affected the tweets of Iraqi users. Creating an Iraqi Arabic Dialect corpus is crucial for sentiment analysis to study such behaviors. Since no such corpus existed, this paper introduces the Corpus of Iraqi Arabic Dialect (CIAD). The corpus has been collected, annotated and made publicly accessible to other researchers for further investigation. Furthermore, the created corpus has been validated using eight different combinations of four feature-selections approaches and two versions of Support Vector Machine (SVM) algorithm. Various performance measures were calculated. The obtained accuracy, 78 %, indicates a promising potential application.
Other Latest Articles
- Auxiliary arbitrary waveform generator for fiber optic gyroscope
- Algorithm for energy-efficient interaction of wireless sensor network nodes
- Classification of short texts using a wave model
- Methods of local features extraction in person authentication task by face thermographic image
- Cloud computing simulation model with a sporadic mechanism of parallel task solving control
Last modified: 2022-04-28 17:56:23