Title :
Signal conditioning techniques for robust speech recognition
Author :
Rahim, Mazin G. ; Juang, Biing-hwang ; Chou, Wu ; Buhrke, Eric
Author_Institution :
AT&T Bell Labs., Murray Hill, NJ, USA
fDate :
4/1/1996 12:00:00 AM
Abstract :
Acoustic mismatch encountered in various training and testing conditions of hidden Markov model (HMM) based systems often causes severe degradation in speech recognition performance. For telephone based speech recognition tasks, acoustic mismatch can arise from various sources, such as variations in telephone handsets, ambient noise, and channel distortions. This paper presents three techniques for blind channel equalization, namely, cepstral mean subtraction (CMS), signal bias removal (SBR) and hierarchical signal bias removal (HSBR). Experimental results on various connected digits databases show a reduction in the digit error rate by 16%, 21%, and 28% when employing CMS, SBR, and HSBR, respectively. Our results also demonstrate that the HSBR technique outperforms SBR and CMS on every sub-data collection and exhibits consistent improvements even for short utterances.
Keywords :
cepstral analysis; equalisers; hidden Markov models; noise; speech processing; speech recognition; telecommunication channels; telephony; HMM based systems; acoustic mismatch; ambient noise; blind channel equalization; cepstral mean subtraction; channel distortions; connected digits databases; digit error rate; experimental results; hidden Markov model; hierarchical signal bias removal; robust speech recognition; signal bias removal; signal conditioning techniques; telephone based speech recognition; telephone handsets; Acoustic noise; Acoustic testing; Collision mitigation; Degradation; Hidden Markov models; Noise robustness; Speech recognition; System testing; Telephone sets; Telephony;
Journal_Title :
Signal Processing Letters, IEEE