Logo des Repositoriums
 

SCALA-Speech: An Interactive System for Finding and Analyzing Speech Content in Audio Data

dc.contributor.authorCornaggia-Urrigshardt,Alessia
dc.contributor.authorJarocky,Nikita
dc.contributor.authorKurth,Frank
dc.contributor.authorUrrigshardt,Sebastian
dc.contributor.authorWilkinghoff,Kevin
dc.contributor.editorDemmler, Daniel
dc.contributor.editorKrupka, Daniel
dc.contributor.editorFederrath, Hannes
dc.date.accessioned2022-09-28T17:10:39Z
dc.date.available2022-09-28T17:10:39Z
dc.date.issued2022
dc.description.abstractAudio data does not contain as much static information as images and texts and thus analyses inherently require more time. Although in monitoring applications it is likely that large quantities of the captured audio files do not contain meaningful information, without prior knowledge investigators need to listen to all audio files in full length. In this work, a system for automatically finding and analyzing speech content in audio data is presented. The system provides different speech processing algorithms as well as a graphical interface (SCALA) for assisting investigators in audio analysis. The system consists of four components: speech detection, language recognition, speaker diarization/recognition and keyword spotting. SCALA-Speech structures audio data by recognizing speech regions, used languages and speaker changes, thus enabling investigators to listen to audio data more efficiently. Furthermore, specific speakers and keywords can be annotated and searched for. Usage of SCALA-Speech is demonstrated on audio tracks of videos linked in Twitter posts related to an exemplary topic.en
dc.identifier.doi10.18420/inf2022_06
dc.identifier.isbn978-3-88579-720-3
dc.identifier.pissn1617-5468
dc.identifier.urihttps://dl.gi.de/handle/20.500.12116/39561
dc.language.isoen
dc.publisherGesellschaft für Informatik, Bonn
dc.relation.ispartofINFORMATIK 2022
dc.relation.ispartofseriesLecture Notes in Informatics (LNI) - Proceedings, Volume P-326
dc.subjectAudio Monitoring
dc.subjectSpeech Detection
dc.subjectLanguage Recognition
dc.subjectSpeaker Diarization
dc.subjectKeyword Spotting
dc.subjectDeep Learning
dc.titleSCALA-Speech: An Interactive System for Finding and Analyzing Speech Content in Audio Dataen
gi.citation.endPage90
gi.citation.startPage81
gi.conference.date26.-30. September 2022
gi.conference.locationHamburg
gi.conference.sessiontitleInternational Workshop On Digital Forensics (IWDF)

Dateien

Originalbündel
1 - 1 von 1
Lade...
Vorschaubild
Name:
iwdf_06.pdf
Größe:
835.97 KB
Format:
Adobe Portable Document Format