Logo des Repositoriums
 
Textdokument

Deep Quality-informed Score Normalization for Privacy-friendly Speaker Recognition in unconstrained Environments

Lade...
Vorschaubild

Volltext URI

Dokumententyp

Zusatzinformation

Datum

2017

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Verlag

Gesellschaft für Informatik, Bonn

Zusammenfassung

In scenarios that are ambitious to protect sensitive data in compliance with privacy regulations, conventional score normalization utilizing large proportions of speaker cohort data is not feasible for existing technology, since the entire cohort data would need to be stored on each mobile device. Hence, in this work we motivate score normalization utilizing deep neural networks. Considering unconstrained environments, a quality-informed scheme is proposed, normalizing scores depending on sample quality estimates in terms of completeness and signal degradation by noise. Utilizing the conventional PLDA score, comparison i-vectors, and corresponding quality vectors, we aim at mimicking cohort based score normalization optimizing the Cmin llr discrimination criterion. Examining the I4U data sets for the 2012 NIST SRE, an 8.7% relative gain is yielded in a pooled 55-condition scenario with a corresponding condition-averaged relative gain of 6.2% in terms of Cmin llr . Robustness analyses towards sensitivity regarding unseen conditions are conducted, i.e. when conditions comprising lower quality samples are not available during training.

Beschreibung

Nautsch,Andreas; Steen,Søren Trads; Busch,Christoph (2017): Deep Quality-informed Score Normalization for Privacy-friendly Speaker Recognition in unconstrained Environments. BIOSIG 2017. Gesellschaft für Informatik, Bonn. PISSN: 1617-5468. ISBN: 978-3-88579-664-0. pp. 243-250. Further Conference Contributions. Darmstadt, Germany. 20.-22. September 2017

Zitierform

DOI

Tags