Techniques for Duplicate Detection in Hierarchical Data
Journal: International Journal of Science and Research (IJSR) (Vol.4, No. 7)Publication Date: 2015-07-05
Authors : Suvarna Kale; Basha Vankudothu;
Page : 721-723
Keywords : Duplicate detection; XML data; hierarchical structure; candidate pair;
Abstract
Duplicate detection is nothing but finding multiple representations of a same object and also object which are represented in a dataset. The duplicate detection is important to integration and data cleaning applications and it is studied for relational data in single table, but now data is stored in complex form. In this paper we improve the efficiency and effectiveness of duplicate detection by considering relationship between ancestors and descendants. We apply this strategy by implementing two algorithms RECONA and ADAMA. Recona re-examine an object if its induce neighbours is duplicates. This will reduce re-comparison of elements. Adama is efficient because it does not allow re-comparison
Other Latest Articles
- A Survey on Service Oriented Architecture in Remote Collaboration Systems
- Intelligent Energy Management System based on FPGA and GSM
- Design Mathematical Modeling and Analysis of Underwater Glider
- Improve Quality of Service (QOS) Using Equalization Techniques
- Computational Fluid Analysis for High Lift on Multi-Element Airfoil Used in Civilian Aircraft
Last modified: 2021-06-30 21:50:52