Logo des Repositoriums
 

Enhanced low-latency speaker spotting using selective cluster enrichment

dc.contributor.authorPatino, Jose
dc.contributor.authorDelgado, Héctor
dc.contributor.authorEvans, Nicholas
dc.contributor.editorBrömme, Arslan
dc.contributor.editorBusch, Christoph
dc.contributor.editorDantcheva, Antitza
dc.contributor.editorRathgeb, Christian
dc.contributor.editorUhl, Andreas
dc.date.accessioned2019-06-17T10:00:17Z
dc.date.available2019-06-17T10:00:17Z
dc.date.issued2018
dc.description.abstractLow-latency speaker spotting (LLSS) calls for the rapid detection of known speakers within multi-speaker audio streams. While previous work showed the potential to develop efficient LLSS solutions by combining speaker diarization and speaker detection within an online processing framework, it failed to move significantly beyond the traditional definition of diarization. This paper shows that the latter needs rethinking and that a diarization sub-system tailored to the end application, rather than to the minimisation of the diarization error rate, can improve LLSS performance. The proposed selective cluster enrichment algorithm is used to guide the diarization system to better model segments within a multi-speaker audio stream and hence detect more reliably a given target speaker. The LLSS solution reported in this paper shows that target speakers can be detected with a 16% equal error rate after having been active in multi-speaker audio streams for only 15 seconds.en
dc.identifier.isbn978-3-88579-676-4
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/23786
dc.language.isoen
dc.publisherKöllen Druck+Verlag GmbH
dc.relation.ispartofBIOSIG 2018 - Proceedings of the 17th International Conference of the Biometrics Special Interest Group
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-283
dc.subjectlow-latency speaker spotting
dc.subjectspeaker detection
dc.subjectspeaker diarization
dc.titleEnhanced low-latency speaker spotting using selective cluster enrichmenten
dc.typeText/Conference Paper
gi.citation.publisherPlaceBonn
gi.conference.date26.-28. September 2018
gi.conference.locationDarmstadt

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
BIOSIG_2018_paper_61.pdf
Größe:
167.65 KB
Format:
Adobe Portable Document Format