ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Constructing twitter corpus of Iraqi Arabic Dialect (CIAD) for sentiment analysis

Journal: Scientific and Technical Journal of Information Technologies, Mechanics and Optics (Vol.22, No. 2)

Publication Date:

Authors : ;

Page : 308-316

Keywords : sentiment analysis; data mining; support vector machine; user behaviors; social media mining;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

The number of Twitter users in Iraq has increased significantly in recent years. Major events, the political situation in the country, had a significant impact on the content of Twitter and affected the tweets of Iraqi users. Creating an Iraqi Arabic Dialect corpus is crucial for sentiment analysis to study such behaviors. Since no such corpus existed, this paper introduces the Corpus of Iraqi Arabic Dialect (CIAD). The corpus has been collected, annotated and made publicly accessible to other researchers for further investigation. Furthermore, the created corpus has been validated using eight different combinations of four feature-selections approaches and two versions of Support Vector Machine (SVM) algorithm. Various performance measures were calculated. The obtained accuracy, 78 %, indicates a promising potential application.

Last modified: 2022-04-28 17:56:23