Spreeuwers,LuukBrömme,ArslanBusch,ChristophDantcheva,AntitzaRathgeb,ChristianUhl,Andreas2017-09-262017-09-262017978-3-88579-664-0https://dl.gi.de/handle/20.500.12116/4641De-duplication is defined as the technique to eliminate or link duplicate copies of repeating data. We consider a specific de-duplication application where a subject applies for a new passport and we want to check if he possesses a passport already under another name. To determine this, a facial photograph of the subject is compared to all photographs of the national database of passports.We investigate if state of the art facial recognition is up to this task and find that for a large database about 2 out of 3 duplicates can be found while few or no false duplicates are reported. This means that de-duplication using automated face recognition is feasible in practice.We also present a mathematical model to predict the performance of de-duplication and find that the probability that k false duplicates are returned can be described well by a Poisson distribution using a varying, subject specific false match rate. We present experimental results using a large database of actual passport photographs consisting of 224 000 images of about 100 000 subjects and find that the results are predicted well by our model.enDe-duplicationface recognitionlarge databasebinomial distributionDe-duplication using automated face recognition: a mathematical model and all babies are equally cute1617-5468