Automatic parameter tuning and extended training material: recent advances in the Fraunhofer speech recognition system
dc.contributor.author | Schwenninger, Jochen | |
dc.contributor.author | Stein, Daniel | |
dc.contributor.author | Stadtschnitzer, Michael | |
dc.contributor.editor | Horbach, Matthias | |
dc.date.accessioned | 2019-03-07T09:32:29Z | |
dc.date.available | 2019-03-07T09:32:29Z | |
dc.date.issued | 2013 | |
dc.description.abstract | Building the acoustic and language models on a larger amount of training data is a well-known method for robustifying automatic speech recognition approaches. The adaption of the decoder settings afterwards, however, is often only marginally addressed (e.g. being manually set or using default values provided by a toolkit). Without proper adaption, these settings are most often sub-optimal and lead to degraded performance without unlocking the full potential of the speech recognizer. Ideally, the decoder settings should be optimized after each modification of the language model and/or the acoustic model of the speech recognition system, a task that is typically too tedious for manual work. In this paper, we employ an automatic optimization technique on the Fraunhofer IAIS speech recognition setup as a subsequent step to training data increase. We will present the improvements of the expanded training data for the acoustic models and the optimization of the decoder settings on the German Difficult Speech Corpus. Index Terms: simultaneous perturbation stochastic approximation, free decoding parameters, word error rate optimization | en |
dc.identifier.isbn | 978-3-88579-614-5 | |
dc.identifier.pissn | 1617-5468 | |
dc.identifier.uri | https://dl.gi.de/handle/20.500.12116/20714 | |
dc.language.iso | en | |
dc.publisher | Gesellschaft für Informatik e.V. | |
dc.relation.ispartof | INFORMATIK 2013 – Informatik angepasst an Mensch, Organisation und Umwelt | |
dc.relation.ispartofseries | Lecture Notes in Informatics (LNI) - Proceedings, Volume P-220 | |
dc.subject | simultaneous perturbation stochastic approximation | |
dc.subject | free decoding param- eters | |
dc.subject | word error rate optimization | |
dc.title | Automatic parameter tuning and extended training material: recent advances in the Fraunhofer speech recognition system | en |
dc.type | Text/Conference Paper | |
gi.citation.endPage | 3011 | |
gi.citation.publisherPlace | Bonn | |
gi.citation.startPage | 3002 | |
gi.conference.date | 16.-20. September 2013 | |
gi.conference.location | Koblenz | |
gi.conference.sessiontitle | Regular Research Papers |
Dateien
Originalbündel
1 - 1 von 1