In this paper, we propose a framework for measuring the impact of data privacy techniques, in information theoretic and in data mining terms. The need for data privacy and anonymization is often hampered by the fact that the privacy functions alter the data in non-measurable amounts and details. We propose here to use Mutual Information over non-Euclidean spaces as a means of measuring this distortion. In addition, and following the same principle, we also propose to use Machine Learning techniques in order to quantify the impact of the data obfuscation in terms of further data mining goals.
Yoan Miche (Nokia), Ian Olivier (Nokia), Silke Holtmanns (Nokia), Anton Akusok (Aalto University), Amaury Lendasse (Aalti University), Kaj-Mikael Björk (Åbo Akademi University): On Mutual Information over Non-Euclidean Spaces, Data Mining and Data Privacy Levels