Extracting Person Name, Date and Place from Text Documents Using LEX Tool
Journal: International Journal of Advanced Computer Research (IJACR) (Vol.3, No. 9)Publication Date: 2013-04-16
Authors : Roohi Sharma;
Page : 26-29
Keywords : Regular Expressions; Finite State Automata; Information Extraction; Pattern Matching; Lexical Analyzer.;
Abstract
This paper contains the details of how one can e x tract person name, date and place from a text document using finite state automata and LEX tool. If we sear ch a text document for some important information manually, the process is slow, tedious and error prone. The regular expressions are used to parse textual data to match patterns and extract variables. The lexical analyzer is used in this r e search, which s cans the input program character by character and groups them together to form tokens. This paper describes a technique to perform ident i fication and extraction of information by using LEX tool. It finds the names, date and places that appear in machine - r eadable text document. Regular e x pressions through which required information is extracted are also discussed.
Other Latest Articles
- Result Analysis of Proposed Image Enhancement Algorithm Based on a Self Organizing Map Network and Wavelet Transform
- Securing Routing Protocol by Distributed Key Management and Threshold Cryptography in Mobile Ad hoc Network
- Robust LQR Control Design of Gyroscope
- Block Based Information Hiding using Cosine, Hartley, Walsh and Haar Wavelets
- Various Hierarchical Routing Protocols in Wireless Sensor Network: A Survey?
Last modified: 2014-11-28 21:32:19