Nautsch,AndreasSteen,Søren TradsBusch,ChristophBrömme,ArslanBusch,ChristophDantcheva,AntitzaRathgeb,ChristianUhl,Andreas2017-09-262017-09-262017978-3-88579-664-0https://dl.gi.de/handle/20.500.12116/4655In scenarios that are ambitious to protect sensitive data in compliance with privacy regulations, conventional score normalization utilizing large proportions of speaker cohort data is not feasible for existing technology, since the entire cohort data would need to be stored on each mobile device. Hence, in this work we motivate score normalization utilizing deep neural networks. Considering unconstrained environments, a quality-informed scheme is proposed, normalizing scores depending on sample quality estimates in terms of completeness and signal degradation by noise. Utilizing the conventional PLDA score, comparison i-vectors, and corresponding quality vectors, we aim at mimicking cohort based score normalization optimizing the Cmin llr discrimination criterion. Examining the I4U data sets for the 2012 NIST SRE, an 8.7% relative gain is yielded in a pooled 55-condition scenario with a corresponding condition-averaged relative gain of 6.2% in terms of Cmin llr . Robustness analyses towards sensitivity regarding unseen conditions are conducted, i.e. when conditions comprising lower quality samples are not available during training.enspeaker recognitionscore normalizationunconstrained environmentsneural networksdeep learningDeep Quality-informed Score Normalization for Privacy-friendly Speaker Recognition in unconstrained Environments1617-5468