Tunisian Arabic Chat Alphabet Transliteration Using Probabilistic Finite State Transducers
Journal: The International Arab Journal of Information Technology (Vol.16, No. 2)Publication Date: 2019-03-01
Authors : Nadia Karmani Hsan Soussou Adel Alimi;
Page : 295-303
Keywords : Tunisian arabic chat alphabet; tunisian arabic; transliteration; aebWordNet; tunisian arabic morphological analyzer; weighted finite state transducer;
Abstract
Internet is taking more and more scale in Tunisians life, especially after the revolution in 2011. Indeed, Tunisian Internet users are increasingly using social networks, blogs, etc. In this case, they favor Tunisian Arabic chat alphabet, which is a Latin-scripted Tunisian Arabic language. However, few tools were developed for Tunisian Arabic processing in this context. In this paper, we suggest developing a Tunisian Arabic chat alphabet-Tunisian Arabic transliteration machine based on weighted finite state transducers and using a Tunisian Arabic lexicon: aebWordNet (i.e., aeb is the ISO 639-3 code of Tunisian Arabic) and a Tunisian Arabic morphological analyzer. Weighted finite state transducers allow us to follow Tunisian Internet user's transcription behavior when writing Tunisian Arabic chat alphabet texts. This last has not a standard format but respects a regular relation. Moreover, it uses aebWordNet and a Tunisian Arabic morphological analyzer to validate the generated transliterations. Our approach attempts good results compared with existing Arabic chat alphabet-Arabic transliteration tools such as EiKtub.
Other Latest Articles
- Prediction of Future Vulnerability Discovery in Software Applications using Vulnerability Syntax Tree (PFVD-VST)
- Case Retrieval Algorithm Using Similarity Measure and Fractional Brain Storm Optimization for Health Informaticians
- An Efficient Algorithm for Extracting Infrequent Itemsets from Weblog
- Optimal Threshold Value Determination for Land Change Detection
- A Low-Power Self-service Bus Arrival Reminding Algorithm on Smart Phone
Last modified: 2019-04-28 19:25:08