SCALA-Speech: An Interactive System for Finding and Analyzing Speech Content in Audio Data

Audio data does not contain as much static information as images and texts and thus analyses inherently require more time. Although in monitoring applications it is likely that large quantities of the captured audio files do not contain meaningful information, without prior knowledge investigators need to listen to all audio files in full length. In this work, a system for automatically finding and analyzing speech content in audio data is presented. The system provides different speech processing algorithms as well as a graphical interface (SCALA) for assisting investigators in audio analysis. The system consists of four components: speech detection, language recognition, speaker diarization/recognition and keyword spotting. SCALA-Speech structures audio data by recognizing speech regions, used languages and speaker changes, thus enabling investigators to listen to audio data more efficiently. Furthermore, specific speakers and keywords can be annotated and searched for. Usage of SCALA-Speech is demonstrated on audio tracks of videos linked in Twitter posts related to an exemplary topic.

Cornaggia-Urrigshardt,Alessia; Jarocky,Nikita; Kurth,Frank; Urrigshardt,Sebastian; Wilkinghoff,Kevin (2022): SCALA-Speech: An Interactive System for Finding and Analyzing Speech Content in Audio Data. INFORMATIK 2022. DOI: 10.18420/inf2022_06. Gesellschaft für Informatik, Bonn. PISSN: 1617-5468. ISBN: 978-3-88579-720-3. pp. 81-90. International Workshop On Digital Forensics (IWDF). Hamburg. 26.-30. September 2022

Schlagwörter

Audio Monitoring , Speech Detection , Language Recognition , Speaker Diarization , Keyword Spotting , Deep Learning

DOI

10.18420/inf2022_06

Sammlungen

P326 - INFORMATIK 2022 - Informatik in den Naturwissenschaften

Komplettanzeige

SCALA-Speech: An Interactive System for Finding and Analyzing Speech Content in Audio Data

Volltext URI

Dokumententyp

Dateien

Zusatzinformation

Datum

Autor:innen

Zeitschriftentitel

ISSN der Zeitschrift

Bandtitel

Quelle

Verlag

Zusammenfassung

Beschreibung

Schlagwörter

Zitierform

DOI

Tags

Sammlungen