Logo des Repositoriums
 

Value-specific Weighting for Record-level Encodings in Privacy-Preserving Record Linkage

dc.contributor.authorRohde, Florens
dc.contributor.authorFranke, Martin
dc.contributor.authorChristen, Victor
dc.contributor.authorRahm, Erhard
dc.contributor.editorKönig-Ries, Birgitta
dc.contributor.editorScherzinger, Stefanie
dc.contributor.editorLehner, Wolfgang
dc.contributor.editorVossen, Gottfried
dc.date.accessioned2023-02-23T13:59:50Z
dc.date.available2023-02-23T13:59:50Z
dc.date.issued2023
dc.description.abstractPrivacy-preserving record linkage (PPRL) determines records representing the same entitywhile guaranteeing the privacy of individuals. A common approach is to encode plaintext data ofrecords into Bloom filters that enable efficient calculation of similarities. A crucial step of PPRL isthe classification of Bloom filter pairs as match or non-match based on computed similarities. In thecontext of record linkage, several weighting schemes and classification methods are available. Themajority of weighting methods determine and adapt weights by applying the Fellegi&Sunter modelfor each attribute. In the PPRL domain, the attributes of a record are encoded in a joint record-levelBloom filter to impede cryptanalysis attacks so that the application of existing attribute-wise weightingapproaches is not feasible. We study methods that use attribute-specific weights in record-levelencodings and integrate weight adaptation approaches based on individual value frequencies. Theexperiments on real-world datasets show that frequency-dependent weighting schemes improve thelinkage quality as well as the robustness with regard to the threshold selection.en
dc.identifier.doi10.18420/BTW2023-21
dc.identifier.isbn978-3-88579-725-8
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/40326
dc.language.isoen
dc.publisherGesellschaft für Informatik e.V.
dc.relation.ispartofBTW 2023
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-331
dc.subjectPrivacy-preserving record linkage
dc.subjectBloom filter
dc.subjectWeighting
dc.subjectValue-specific
dc.titleValue-specific Weighting for Record-level Encodings in Privacy-Preserving Record Linkageen
dc.typeText/Conference Paper
gi.citation.endPage460
gi.citation.publisherPlaceBonn
gi.citation.startPage439
gi.conference.date06.-10. März 2023
gi.conference.locationDresden, Germany

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
B4-4.pdf
Größe:
513.31 KB
Format:
Adobe Portable Document Format