Logo des Repositoriums
 

ExPD: Semi-automated Web Extraction of Personal Data

dc.contributor.authorBöhner, Alexander
dc.contributor.authorHerrmann, Dominik
dc.date.accessioned2021-12-14T10:57:51Z
dc.date.available2021-12-14T10:57:51Z
dc.date.issued2021
dc.description.abstractAccording to the GDPR, data subjects can issue a Subject Access Request (SAR) to obtain all personal data concerning them from a data controller. Data controllers are interested to automate the handling of SARs, which is challenging for legacy applications and services that lack suitable export functionality. This challenge is addressed by the ExPD prototype, a tool that automates the task of collecting personal data from the pages of web applications. The ExPD operator specifies extraction rules interactively in the browser using a small set of pages. After extraction, a tool-assisted refinement stage allows to fine-tune the exported data. Besides automating the processing of SARs, ExPD may also be useful for citizens who want to obtain an overview of their public data footprint on sites that share user contributions, which is demonstrated in a case study.en
dc.identifier.doi10.18420/informatik2021-085
dc.identifier.isbn978-3-88579-708-1
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/37754
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofINFORMATIK 2021
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-314
dc.subjectSubject Access Requests
dc.subjectCrawler
dc.subjectCase Study
dc.titleExPD: Semi-automated Web Extraction of Personal Dataen
gi.citation.endPage1046
gi.citation.startPage1035
gi.conference.date27. September - 1. Oktober 2021
gi.conference.locationBerlin
gi.conference.sessiontitleWorkshop: Datenschutz im Diskurs (RuT2021)

Dateien

Originalbündel
1 - 1 von 1
Vorschaubild nicht verfügbar
Name:
K1-6.pdf
Größe:
378.93 KB
Format:
Adobe Portable Document Format