مرکز منطقه ای اطلاع رساني علوم و فناوري - A Supervised Learning Approach to Uncertainty Decoding for Robust Speech Recognition

DocumentCode :

2308905

Title :

A Supervised Learning Approach to Uncertainty Decoding for Robust Speech Recognition

Author :

Srinivasan, Soundararajan ; Wang, DeLiang

Author_Institution :

Biomed. Eng. Center, Ohio State Univ., Columbus, OH

Volume :

fYear :

2006

fDate :

14-19 May 2006

Abstract :

Recently several algorithms have been proposed to enhance noisy speech by estimating a binary mask that can be used to select those time-frequency regions of a noisy speech signal that contain more speech energy than noise energy. This binary mask encodes the uncertainty associated with enhanced speech in the linear spectral domain. The use of the cepstral transformation leads to a smearing of this uncertainty. We propose a supervised approach to learn the non linear transformation of the uncertainty from the linear spectral domain to the cepstral domain. This uncertainty is used by a decoder that exploits the variance associated with the enhanced cepstral features to improve robust speech recognition. Systematic evaluations on a subset of the Aurora4 task using the estimated uncertainty shows substantial improvement over the baseline performance

Keywords :

decoding; learning (artificial intelligence); speech coding; speech recognition; time-frequency analysis; binary mask; cepstral domain; enhanced cepstral features; linear spectral domain; noisy speech signal; robust speech recognition; supervised learning approach; time-frequency regions; uncertainty decoding; Acoustic noise; Cepstral analysis; Decoding; Mel frequency cepstral coefficient; Robustness; Speech coding; Speech enhancement; Speech recognition; Supervised learning; Uncertainty;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Acoustics, Speech and Signal Processing, 2006. ICASSP 2006 Proceedings. 2006 IEEE International Conference on

Conference_Location :

Toulouse

ISSN :

1520-6149

Print_ISBN :

1-4244-0469-X

Type :

conf

DOI :

10.1109/ICASSP.2006.1660016

Filename :

1660016

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2308905