GI LogoGI Logo
  • Login
Digital Library
    • All of DSpace

      • Communities & Collections
      • Titles
      • Authors
      • By Issue Date
      • Subjects
    • This Collection

      • Titles
      • Authors
      • By Issue Date
      • Subjects
Digital Library Gesellschaft für Informatik e.V.
GI-DL
    • English
    • Deutsch
  • English 
    • English
    • Deutsch
View Item 
  •   DSpace Home
  • Lecture Notes in Informatics
  • Proceedings
  • BIOSIG - Biometrics and Electronic Signatures
  • P282 - BIOSIG 2018 - Proceedings of the 17th International Conference of the Biometrics Special Interest Group
  • View Item
JavaScript is disabled for your browser. Some features of this site may not work without it.
  •   DSpace Home
  • Lecture Notes in Informatics
  • Proceedings
  • BIOSIG - Biometrics and Electronic Signatures
  • P282 - BIOSIG 2018 - Proceedings of the 17th International Conference of the Biometrics Special Interest Group
  • View Item

Enhanced low-latency speaker spotting using selective cluster enrichment

Author:
Patino, Jose [DBLP] ;
Delgado, Héctor [DBLP] ;
Evans, Nicholas [DBLP]
Abstract
Low-latency speaker spotting (LLSS) calls for the rapid detection of known speakers within multi-speaker audio streams. While previous work showed the potential to develop efficient LLSS solutions by combining speaker diarization and speaker detection within an online processing framework, it failed to move significantly beyond the traditional definition of diarization. This paper shows that the latter needs rethinking and that a diarization sub-system tailored to the end application, rather than to the minimisation of the diarization error rate, can improve LLSS performance. The proposed selective cluster enrichment algorithm is used to guide the diarization system to better model segments within a multi-speaker audio stream and hence detect more reliably a given target speaker. The LLSS solution reported in this paper shows that target speakers can be detected with a 16% equal error rate after having been active in multi-speaker audio streams for only 15 seconds.
  • Citation
  • BibTeX
Patino, J., Delgado, H. & Evans, N., (2018). Enhanced low-latency speaker spotting using selective cluster enrichment. In: Brömme, A., Busch, C., Dantcheva, A., Rathgeb, C. & Uhl, A. (Hrsg.), BIOSIG 2018 - Proceedings of the 17th International Conference of the Biometrics Special Interest Group. Bonn: Köllen Druck+Verlag GmbH.
@inproceedings{mci/Patino2018,
author = {Patino, Jose AND Delgado, Héctor AND Evans, Nicholas},
title = {Enhanced low-latency speaker spotting using selective cluster enrichment},
booktitle = {BIOSIG 2018 - Proceedings of the 17th International Conference of the Biometrics Special Interest Group},
year = {2018},
editor = {Brömme, Arslan AND Busch, Christoph AND Dantcheva, Antitza AND Rathgeb, Christian AND Uhl, Andreas},
publisher = {Köllen Druck+Verlag GmbH},
address = {Bonn}
}
DateienGroesseFormatAnzeige
BIOSIG_2018_paper_61.pdf167.6Kb PDF View/Open

Haben Sie fehlerhafte Angaben entdeckt? Sagen Sie uns Bescheid: Send Feedback

More Info

ISBN: 978-3-88579-676-4
ISSN: 1617-5468
xmlui.MetaDataDisplay.field.date: 2018
Language: en (en)
Content Type: Text/Conference Paper

Keywords

  • low-latency speaker spotting
  • speaker detection
  • speaker diarization
Collections
  • P282 - BIOSIG 2018 - Proceedings of the 17th International Conference of the Biometrics Special Interest Group [32]

Show full item record


About uns | FAQ | Help | Imprint | Datenschutz

Gesellschaft für Informatik e.V. (GI), Kontakt: Geschäftsstelle der GI
Diese Digital Library basiert auf DSpace.

 

 


About uns | FAQ | Help | Imprint | Datenschutz

Gesellschaft für Informatik e.V. (GI), Kontakt: Geschäftsstelle der GI
Diese Digital Library basiert auf DSpace.