ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

TEXT INFORMATION EXTRACTION USING RULE BASED METHOD

Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.4, No. 8)

Publication Date:

Authors : ;

Page : 457-467

Keywords : Information Extraction; Text routing; Text Mining; knowledge discovery; structured data; Semi;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Information is hidden in large volume of files thus it is necessary to find useful information and extract it from file contents. Information Extraction (IE) is the task of automatically extracting structured information from unstructured or semi - structured documents. The data in all available files out of total 80% falls in category of unstructured text or semi structured text, this data is typically heavy, but may contain facts as well as very useful information. When we search any useful infor mation from files is very tedious, since searching algorithms have high complexity and require time to search each word. Or in today’s Era everything is going to be store in form of files in computers and both online and offline sources generate large amou nt of text data on daily basis. So gathering or retrieval of information from large volume of data via searching algorithm is not preferred so we use concept of information extraction. Many methods have been proposed for automating the process of extractio n, but due to the heterogeneity and lack of structure of file contents automated discovery of information still faces many challenges in new researches. This research paper will going to presents a system which is a powerful toolkit for rule - based informat ion extraction. Developed system is based on top down approach of rule based method and provides versatile information processing and advanced extraction techniques. We thoroughly describe the system and its capabilities for extraction and performance calc ulation based n certain parameters.

Last modified: 2015-08-17 19:55:18