ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Network Pruning-Detecting Duplicate Efficiently in XML Data.

Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.3, No. 4)

Publication Date:

Authors : ; ;

Page : 2063-2065

Keywords : XML; Duplicate detection; Bayesian networks; Network pruning;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Duplicate detection is a non-trivial task in which duplicates are not exactly equal due to error in the data and objects. The existing system uses a method called XMLDup. It considers only the XML data files to detect duplicate and non duplicate files. This method uses Bayesian network model to determine the probability of two XML elements being duplicate. It also uses network pruning algorithm to increase the BN evaluation time. This algorithm achieve high precision and recall scores in terms of both efficiency and effectiveness. In the proposed work aimed to extend the BN evaluation time using machine learning algorithm.

Last modified: 2014-05-10 21:25:25