DocumentCode :
3378218
Title :
On the Use of Forward Temporal Masking and Cumulative Distribution Mapping for Noisy Speech Recognition
Author :
Choi, Eric H C ; Epps, Julien
Author_Institution :
Interfaces, Machines & Graphic Environments, Nat. ICT Australia, Sydney, NSW
fYear :
2005
fDate :
21-24 Nov. 2005
Firstpage :
1
Lastpage :
6
Abstract :
Robustness in the presence of various types and levels of environmental noise remains an important issue for automatic speech recognition (ASR) systems. This paper describes a new noise-robust ASR front-end that employs a functional model of forward temporal masking combined with cumulative distribution mapping based on MFCC´s with c0. Recognition experiments on the Aurora II connected digits database reveal that the proposed front-end achieves an average digit recognition accuracy of 83.24% for a model set trained from clean data and 90.32% for a model set trained from data with multiple noise conditions. Compared with the ETSI standard Mel-cepstral front-end, the proposed front-end obtains a relative error reduction of around 57% for the clean model set and 21% for the multi-condition model set.
Keywords :
noise (working environment); speech recognition; automatic speech recognition; cumulative distribution mapping; environmental noise; forward temporal masking; noisy speech recognition; Automatic speech recognition; Cepstral analysis; Databases; Noise robustness; Psychoacoustic models; Signal to noise ratio; Speech enhancement; Speech recognition; Telecommunication standards; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
TENCON 2005 2005 IEEE Region 10
Conference_Location :
Melbourne, Qld.
Print_ISBN :
0-7803-9311-2
Electronic_ISBN :
0-7803-9312-0
Type :
conf
DOI :
10.1109/TENCON.2005.301108
Filename :
4084996
Link To Document :
بازگشت