Privacy Preserving K-means Clustering: A Survey Research

Privacy Preserving K-means Clustering: A Survey Research

Fatima Meskine1 and Safia Nait Bahloul2
1Department of Computer Science, University of Es-Senia, Algeria
2Department of Computer Science, University Oran, Algeria
 
Abstract: The clustering is an exploratory task of data mining. This raised a real problem of privacy when the data are from different sources. Most of researches on privacy preservation in clustering are developed for k-means clustering algorithm, by applying the secure multi-party computation framework. . The distribution of data may be different (vertical, horizontal or arbitrary). Approaches allowing solving the problem on a vertical, horizontal and even arbitrary partitioned dataset were proposed.  The major interest is to reveal the minimum of information during the execution of the algorithm, especially in k-means iterations, which poses a real challenge for secure multi party computation.  This work consists to study and analyze all works of privacy preserving in the k-means algorithm, classify the various approaches according to the used data distribution while presenting the weaknesses and the strong points of each protocol regards to privacy. The interest is to arise the real needs of privacy  during the execution of the different steps of k-mean algorithm, thus to discover the best of approaches in case of preserving privacy in k-means algorithm.

Keywords:  k-means clustering algorithm, secure multi-party computation, distributed data, privacy preserving.

Received December 31, 2010, accepted March 1, 2011

Read 3736 times Last modified on Sunday, 19 February 2012 04:47
Share
Top
We use cookies to improve our website. By continuing to use this website, you are giving consent to cookies being used. More details…