DocumentCode :
1395783
Title :
Improving Speech Intelligibility in Noise Using Environment-Optimized Algorithms
Author :
Kim, Gibak ; Loizou, Philipos C.
Author_Institution :
Dept. of Electr. Eng., Univ. of Texas at Dallas, Richardson, TX, USA
Volume :
18
Issue :
8
fYear :
2010
Firstpage :
2080
Lastpage :
2090
Abstract :
While most speech enhancement algorithms improve speech quality, they may not improve speech intelligibility in noise. This paper focuses on the development of an algorithm that can be optimized for a specific acoustic environment and improve speech intelligibility. The proposed method decomposes the input signal into time-frequency (T-F) units and makes binary decisions, based on a Bayesian classifier, as to whether each T-F unit is dominated by the target signal or the noise masker. Target-dominated T-F units are retained while masker-dominated T-F units are discarded. The Bayesian classifier is trained for each acoustic environment using an incremental approach that continuously updates the model parameters as more data become available. Listening experiments were conducted to assess the intelligibility of speech synthesized using the incrementally adapted models as a function of the number of training sentences. Results indicated substantial improvements in intelligibility (over 60% in babble at -5 dB SNR) with as few as ten training sentences in babble and at least 80 sentences in other noisy conditions.
Keywords :
Bayes methods; acoustic noise; speech intelligibility; speech synthesis; Bayesian classifier; acoustic environment; binary decisions; environment optimized algorithms; speech enhancement algorithms; speech intelligibility; speech synthesizer; time-frequency units; Acoustic distortion; Acoustic noise; Background noise; Bayesian methods; Signal to noise ratio; Speech analysis; Speech enhancement; Speech synthesis; Time frequency analysis; Working environment noise; Environment-optimized algorithms; speech enhancement; speech intelligibility;
fLanguage :
English
Journal_Title :
Audio, Speech, and Language Processing, IEEE Transactions on
Publisher :
ieee
ISSN :
1558-7916
Type :
jour
DOI :
10.1109/TASL.2010.2041116
Filename :
5398891
Link To Document :
بازگشت