DocumentCode
2705229
Title
A Temporal Auditory Model with Adaptation for Automatic Speech Recognition
Author
Haque, Showera ; Togneri, Roberto ; Zaknich, A.
Author_Institution
Sch. of Electr., Electron. & Comput. Eng., Western Australia Univ., Nedlands, WA, Australia
Volume
4
fYear
2007
fDate
15-20 April 2007
Abstract
Rapid and short-term adaptation are dynamic mechanisms of human auditory system. An auditory model based on zero-crossings with peak amplitudes (ZCPA) was used as a front-end for automatic speech recognition (ASR) with the perceptual property of adaptation as determined by psychoacoustic observations. The model performance was evaluated on the isolated digits (TIDIGITS) database using continuous density HMM recognizer in additive noise environment. Experimental results indicate that the ASR performance of the ZCPA may be improved with adaptation over the static baseline performance in white Gaussian and factory noise. The perceptual front-end was also evaluated with dynamic (delta and delta-delta) features added to the adaptation. It was observed that adaptation with dynamic features performed better in factory, babble and car noise over a wide range of SNR values.
Keywords
AWGN; hidden Markov models; speech recognition; SNR; TIDIGITS; additive noise environment; automatic speech recognition; car noise; continuous density HMM recognizer; delta-delta features; factory noise; human auditory system; isolated digits database; peak amplitudes; psychoacoustic observations; temporal auditory model; white Gaussian noise; zero-crossings; Adaptation model; Additive noise; Auditory system; Automatic speech recognition; Hidden Markov models; Humans; Production facilities; Psychoacoustic models; Psychology; Spatial databases; Auditory system; adaptive system; feature extraction; hidden Markov model; speech recognition;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech and Signal Processing, 2007. ICASSP 2007. IEEE International Conference on
Conference_Location
Honolulu, HI
ISSN
1520-6149
Print_ISBN
1-4244-0727-3
Type
conf
DOI
10.1109/ICASSP.2007.367276
Filename
4218307
Link To Document