ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

EFFICIENT LOCAL RECODING ANONYMIZATION FOR DATASETS WITHOUT ATTRIBUTE HIERARCHICAL STRUCTURE

Proceeding: The Second International Conference on Cyber Security, Cyber Peacefare and Digital Forensic (CyberSec)

Publication Date:

Authors : ;

Page : 130-140

Keywords : Privacy Preserving; Data Mining; K-anonymity; Algorithm; Security;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Privacy is one of the main concerns in data publishing especially when releasing datasets involving human subjects contain sensitive information. Hence, to protect the privacy of individuals, a model that is widely used for privacy preservation in publishing micro-data, is k-anonymity. It reduces the linking confidence between sensitive information and specific individual by 1/k ratio. However, k-anonymous dataset loses its accuracy due to the information loss. Most of the existing k-anonymization approaches suffer from huge information loss. In this paper we study the information loss issue and we propose a new model based on distance calculation between tuples including numerical and categorical attributes which is independent of attributes hierarchical structures. Then based on the proposed model we present the SpatialDistance (SD) heuristic algorithm for k-anonymization. Our extensive study on real datasets shows that the proposed algorithm in comparison with existing well-known algorithms offers much higher data utility and reduces the information loss significantly. It also provides higher privacy protection for outliers.

Last modified: 2013-06-18 22:05:50