مرکز منطقه ای اطلاع رساني علوم و فناوري - Robust speech recognition using multiple prior models for speech reconstruction

DocumentCode :

2175912

Title :

Robust speech recognition using multiple prior models for speech reconstruction

Author :

Narayanan, Arun ; Zhao, Xiaojia ; Wang, DeLiang ; Fosler-Lussier, Eric

Author_Institution :

Dept. of Comput. Sci. & Eng., Ohio State Univ., Columbus, OH, USA

fYear :

2011

fDate :

22-27 May 2011

Firstpage :

4800

Lastpage :

4803

Abstract :

Prior models of speech have been used in robust automatic speech recognition to enhance noisy speech. Typically, a single prior model is trained by pooling the entire training data. In this paper we propose to train multiple prior models of speech instead of a single prior model. The prior models can be trained based on distinct characteristics of speech. In this study, they are trained based on voicing characteristics. The trained prior models are then used to reconstruct noisy speech. Significant improvements are obtained on the Aurora-4 robust speech recognition task when multiple priors are used; in conjunction with an uncertainty transform technique, multiple priors yield a 13.7% absolute improvement in the average word error rate over directly recognizing noisy speech.

Keywords :

signal reconstruction; speech enhancement; speech recognition; Aurora-4 robust speech recognition; multiple prior models; noisy speech enhancement; speech reconstruction; voicing characteristics; Hidden Markov models; Noise; Noise measurement; Robustness; Speech; Speech recognition; Uncertainty; Aurora-4; CASA; Robust ASR; feature reconstruction; uncertainty transform;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing (ICASSP), 2011 IEEE International Conference on

Conference_Location :

Prague

ISSN :

1520-6149

Print_ISBN :

978-1-4577-0538-0

Electronic_ISBN :

1520-6149

Type :

conf

DOI :

10.1109/ICASSP.2011.5947429

Filename :

5947429

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2175912