ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Correlation between Privacy Preserving Data Publishing and Feature Selection Stability

Journal: International Journal of Emerging Trends & Technology in Computer Science (IJETTCS) (Vol.4, No. 52)

Publication Date:

Authors : ; ;

Page : 1-3

Keywords : Keywords: data mining; feature selection stability; Jaccard index; privacy preserving data publishing; kanonymity; l-diversity; t-closeness; slicing;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Abstract Data mining is the technique of getting useful information from huge amount of available data stored in organizations. In these days, microdata published are high-dimensional. Feature selection is an important dimensionality reduction technique for data mining. Selection stability is the robustness of feature selection algorithms for the small perturbations of data sets. Selection stability is an important criterion for the feature selection in data mining. Earlier researches have been in the direction that selection stability is algorithmic dependent. But recently researches proved that selection stability is data dependent but not completely algorithmic independent. Privacy preserving data publishing is the publishing of public data to the private parties for research purposes after the perturbation of data to preserve privacy. In privacy preserving data publishing, we modify the data in some way in order to preserve the privacy of the data. This perturbation affects the selection stability and also the utility of the data. This paper finds the relationship between privacy preserving data publishing and feature selection stability.

Last modified: 2016-06-02 15:21:00