A FRAME WORK FOR WEB INFORMATION EXTRACTION AND ANALYSIS
Journal: INTERNATIONAL JOURNAL OF COMPUTERS & TECHNOLOGY (Vol.7, No. 2)Publication Date: 2013-01-01
Authors : Dr Sunitha Abburu; G. Suresh Babu;
Page : 574-579
Keywords : Web Crawling; Information Extraction; Data Mining; Data Analysis;
Abstract
Day by day the volume of information availability in the web is growing significantly. There are several data structures for information available in the web such as structured, semi-structured and unstructured. Majority of information in the web is presented in web pages. The information presented in web pages is semi-structured.? But the information required for a context are scattered in different web documents. It is difficult to analyze the large volumes of semi-structured information presented in the web pages and to make decisions based on the analysis. The current research work proposed a frame work for a system that extracts information from various sources and prepares reports based on the knowledge built from the analysis. This simplifies ?data extraction, data consolidation, data analysis and decision making based on the information presented in the web pages.The proposed frame work integrates web crawling, information extraction and data mining technologies for better information analysis that helps in effective decision making.?? It enables people and organizations to extract information from various sourses of web and to make an effective analysis on the extracted data for effective decision making.? The proposed frame work is applicable for any application domain. Manufacturing,sales,tourisum,e-learning are various application to menction few.The frame work is implemetnted and tested for the effectiveness of the proposed system and the results are promising.
Other Latest Articles
- MOBILE AGENT APPLICATION DEVELOPMENT IN A SIMPLE JAVA-BASED MOBILE AGENT SYSTEM (SIMMAS)
- Shape Matching and Recognition using Hybrid Features from Skeleton and Boundary
- Dynamic Clustering Protocol for Data Forwarding in Wireless Sensor Networks
- Efficient Detection of SPAM messages and SPAM zombies in the Internet using Naïve-Bayesian and Sequential Probability Ratio Test (SPRT)
- A PREDICTIVE CODING METHOD FOR LOSSLESS COMPRESSION OF IMAGES
Last modified: 2016-06-29 19:33:34