Network Pruning-Detecting Duplicate Efficiently in XML Data.Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.3, No. 4)
Publication Date: 2014-04-30
Authors : M.Lakshmipriya; G.Loganathan;
Page : 2063-2065
Keywords : XML; Duplicate detection; Bayesian networks; Network pruning;
Duplicate detection is a non-trivial task in which duplicates are not exactly equal due to error in the data and objects. The existing system uses a method called XMLDup. It considers only the XML data files to detect duplicate and non duplicate files. This method uses Bayesian network model to determine the probability of two XML elements being duplicate. It also uses network pruning algorithm to increase the BN evaluation time. This algorithm achieve high precision and recall scores in terms of both efficiency and effectiveness. In the proposed work aimed to extend the BN evaluation time using machine learning algorithm.
Other Latest Articles
Last modified: 2014-05-10 21:25:25