Improving Anonymization Clustering

Microaggregation is a technique to preserve privacy when confidential information about individuals shall be used by third parties. A basic property to be established is called k-anonymity. It requires that identifying information about individuals should not be unique, instead there has to be a group of size at least k that looks identical. This is achieved by clustering individuals into appropriate groups and then averaging the identifying information. The question arises how to select these groups such that the information loss by averaging is minimal. This problem has been shown to be NP-hard. Thus, several heuristics called MDAV, V-MDAV,... have been proposed for finding at least a suboptimal clustering. This paper proposes a more sophisticated, but still efficient strategy called MDAV* to construct a good clustering. The question whether to extend a group locally by individuals close by or to start a new group with such individuals is investigated in more depth. This way, a noticeable lower information loss can be achieved which is shown by applying MDAV* to several established benchmarks of real data and also to specifically designed random data.

Thaeter, Florian; Reischuk, Rüdiger (2018): Improving Anonymization Clustering. SICHERHEIT 2018. DOI: 10.18420/sicherheit2018_05. Bonn: Gesellschaft für Informatik e.V.. PISSN: 1617-5468. ISBN: 978-3-88579-675-6. pp. 69-82. Wissenschaftliche Beiträge. Konstanz, Germany. 25.-27. April 2018

Schlagwörter

Microdata anonymization , k-Anonymity , Microaggregation , group clustering

DOI

10.18420/sicherheit2018_05

Sammlungen

P281 - Sicherheit 2018 - Sicherheit, Schutz und Zuverlässigkeit

Komplettanzeige

Improving Anonymization Clustering

Volltext URI

Dokumententyp

Dateien

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen