DocumentCode
2761595
Title
Robust recognition of noisy speech over H.323 networks
Author
Chen, Gang ; Tolba, Hesham ; O´Shaughnessy, Douglas
Author_Institution
INRS-EMT, Quebec Univ., Que.
fYear
2005
fDate
1-4 May 2005
Firstpage
1247
Lastpage
1250
Abstract
In this paper, we investigate the performance of a speech recognizer on noisy speech transmitted over an H.323 channel, where the minimum mean-square error log spectra amplitude (MMSE-LSA) method is used to reduce the mismatch between training and deployment condition in order to achieve robust speech recognition. In the IP communication environment, one of the sources of distortion to the speech is packet loss. Of course, when ASR systems are used in adverse conditions, their performance degrades. In our work, we not only evaluate the impact of packet losses on speech recognition performance, but also explore the effects of uncorrelated additive noise on the performance. For measuring the influence of missing speech packets on the ASR system performance, we use a Soekris net 4501 IP simulator made by the Engineering Soekris Engineering Company, in order to control packet loss rate. To explore how additive acoustic noise affects the speech recognition performance, six types of noise sources are selected for use in our experiments. The experimental results indicate that the MMSE-LSA enhancement method apparently increased robustness for some type of additive noise under certain packet loss rates over the IP
Keywords
IP networks; least mean squares methods; speech coding; speech recognition; H.323 networks; IP communication environment; MMSE; additive acoustic noise; additive noise; minimum mean-square error log spectra amplitude method; packet loss rate; performance degradation; speech recognition robustness; Acoustic noise; Acoustical engineering; Additive noise; Automatic speech recognition; Noise level; Noise reduction; Performance loss; Robustness; Speech recognition; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Electrical and Computer Engineering, 2005. Canadian Conference on
Conference_Location
Saskatoon, Sask.
ISSN
0840-7789
Print_ISBN
0-7803-8885-2
Type
conf
DOI
10.1109/CCECE.2005.1557203
Filename
1557203
Link To Document