A Proposed Model for Extracting Information from Arabic-Based Controlled Text Domains, Discussing the Initial Model Steps
Journal: International Journal of Applied and Natural Sciences (IJANS) (Vol.7, No. 2)Publication Date: 2018-03-16
Authors : Mohammad Fasha Nadim Obeid; Bassam Hammo;
Page : 65-86
Keywords : Narabic Natural Language Processing POS Tagging Ontology Based Information Extraction Description Logic;
Abstract
Information extraction from Arabic as well as other languages text is commonly implemented over restricted text domains. Approaching open text domains is challenging, because of the syntactic, semantic and pragmatics ambiguities and variations in text. For the purpose of approaching more relaxed versions of Arabic text domains, Fasha et al. (Fasha et al. 2017) presented a high-level description fora proposed work methodology that can establish a model for extracting information from controlled text domains. In that work, controlled text domains were defined as the text domains that are not restricted in their linguistic features or their knowledge types yet they are not very unanticipated in these respects. In this paper, we discuss that work methodology and its implementation in more detail. Our discussion includes the initial phases of the methodology which covers the corpus preparation processes including its selection, analysis and annotation using a custom morpho-syntactic Part-of-Speech tagging scheme, we also discuss the designing of the supporting knowledge-base model which will be used to represent and process the extracted information. The information extraction algorithm itself shall be presented in a future work.
Other Latest Articles
- Redescription of Five Species of the Genus Lycodon (Boie, 1826) (Serpents, Colubridae) on the Basis of Morphological Variation Collected From Birbhum, West Bengal, India
- : 2319-3999; ISSN(E): 2319-4006 VARIABLE SELECTION PROCEDURES FOR LOGISTIC REGRESSI
- DELIRIUM POST CARDIAC SURGERY: REVIEW ON EPIDEMIOLOGY AND ASSOCIATED RISK FACTORS
- ANALYSIS OF MULTIPLE LINEAR REGRESSION MODELS USING SYMBOLIC INTERVAL-VALUED VARIABLES
- ORE’S THEOREM, LABELLED GRAPHS, FACEBOOK
Last modified: 2018-03-28 21:06:51