DocumentCode
705132
Title
Comparison of noise robust methods in large vocabulary speech recognition
Author
Keronen, Sami ; Remes, Ulpu ; Palomaki, Kalle J. ; Virtanen, Tuomas ; Kurimo, Mikko
Author_Institution
Adaptive Inf. Res. Centre, Aalto Univ., Aalto, Finland
fYear
2010
fDate
23-27 Aug. 2010
Firstpage
1973
Lastpage
1977
Abstract
In this paper, a comparison of three fundamentally different noise robust approaches is carried out. The recognition performances of multicondition training, Data-driven Parallel Model Combination (DPMC), and cluster-based missing data reconstruction methods implemented in a large vocabulary continuous speech recognition system are evaluated with Finnish language speech data consisting of real recordings in noisy environments. All three methods improve the recognition accuracy substantially in poor signal-to-noise ratio (SNR) conditions when compared to a baseline system trained on clean speech. DPMC and missing data reconstruction systems give the best performance on high SNR conditions. On low SNR conditions, the performance of multicondition trained system is ranked the best, DPMC the second best and missing data reconstruction the third.
Keywords
signal denoising; signal reconstruction; speech recognition; cluster-based missing data reconstruction methods; data-driven parallel model combination; large vocabulary speech recognition; multicondition training; noise robust methods; noisy environments; recognition accuracy; Hidden Markov models; Noise measurement; Signal to noise ratio; Speech; Speech recognition; Training;
fLanguage
English
Publisher
ieee
Conference_Titel
Signal Processing Conference, 2010 18th European
Conference_Location
Aalborg
ISSN
2219-5491
Type
conf
Filename
7096405
Link To Document