ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Automatic Annotation Search from Web-Database?

Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.4, No. 1)

Publication Date:

Authors : ; ;

Page : 254-261

Keywords : Data alignment; Data annotation; Wrapper-generation; Alignment;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

In this system, we address the problem of automatically extracting data objects from a given web site and assigning meaningful labels to the data. In this system we majorly look on the web sites that provide a complex HTML search form, other than keyword searching, for users to query the back-end databases. Solving this problem will allow the data both to be extracted from such web sites and its schema to be captured, which makes it easier to do further manipulation and integration of the data. This problem is for three reasons. First, the system deal with HTML searches forms, which are designed for human use. it makes difficult for html code to identify all the form elements and submit correct queries. Second, the wrapper generate for each html page needs to be more efficient enough to extract not only plain and nested structure data. Third, the generated wrap- per is usually based on HTML structure of the tags, which may never affect the real database structure, and the original database field names are generally not encoded in the web pages. In addition, for large scale data, the solution to this problem needs to be automatic and fast . The online shopping today having great popularity and rapid growth .The Web or internet has become the most important medium for many applications, such as e-commerce and digital libraries. Database-driven Web sites have their own interfaces and access method for creating HTML pages on the fly. Web database techniques define the various ways that can connect to and retrieve or access data from database servers. In this paper, we present an automatic annotation (assign label) approach that first aligns the data units on a result page into various groups such that the data in the same group have the same meaning. And then we assign labels to each of this group .An annotation cover for the search site is automatically constructed and can be used to assign label to new result pages from the semantic web.

Last modified: 2015-01-22 23:07:37