Now showing items 1-3 of 3
Cluster Flow - an Advanced Concept for Ensemble-Enabling, Interactive Clustering
Even though most clustering algorithms serve knowledge discovery in fields other than computer science, most of them still require users to be familiar with programming or data mining to some extent. As that often prevents efficient research, we developed an easy to use, highly explainable clustering method accompanied ...
Extended Affinity Propagation Clustering for Multi-source Entity Resolution
Entity resolution is the data integration task of identifying matching entities (e.g. products, customers) in one or several data sources. Previous approaches for matching and clustering entities between multiple (>2) sources either treated the different sources as a single source or assumed that the individual sources ...
Multi-Party Privacy Preserving Record Linkage in Dynamic Metric Space
We propose and evaluate several approaches for multi-party privacy-preserving record linkage (MP-PPRL) for multiple data sources. To reduce the number of comparisons for scalability we propose a new pivot-based metric space approach that dynamically adapts the selection of pivots for additional sources and growing data ...