DocumentCode :
310599
Title :
HMM-based speech enhancement using harmonic modeling
Author :
Deisher, Michael E. ; Spanias, Andreas S.
Author_Institution :
Intel Corp., Hillsboro, OR, USA
Volume :
2
fYear :
1997
fDate :
21-24 Apr 1997
Firstpage :
1175
Abstract :
This paper describes a technique for reduction of non-stationary noise in electronic voice communication systems. Removal of noise is needed in many such systems, particularly those deployed in harsh mobile or otherwise dynamic acoustic environments. The proposed method employs state-based statistical models of both speech and noise, and is thus capable of tracking variations in noise during sustained speech. This work extends the hidden Markov model (HMM) based minimum mean square error (MMSE) estimator to incorporate a ternary voicing state, and applies it to a harmonic representation of voiced speech. Noise reduction during voiced sounds is thereby improved. Performance is evaluated using speech and noise from standard databases. The extended algorithm is demonstrated to improve speech quality as measured by informal preference tests and objective measures, to preserve speech intelligibility as measured by informal diagnostic rhyme tests, and to improve the performance of a low bit-rate speech coder and a speech recognition system when used as a pre-processor
Keywords :
acoustic noise; harmonic analysis; hidden Markov models; interference suppression; least mean squares methods; speech coding; speech enhancement; speech intelligibility; speech processing; speech recognition; voice communication; HMM; MMSE estimator; dynamic acoustic environments; extended algorithm; harmonic modeling; harmonic representation; hidden Markov model; informal diagnostic rhyme tests; informal preference tests; low bit-rate speech coder; minimum mean square error estimator; non-stationary noise reduction; objective measures; performance evaluation; speech enhancement; speech intelligibility; speech quality improvement; speech recognition system; state-based statistical models; ternary voicing state; voice communication systems; voiced speech; Acoustic noise; Databases; Hidden Markov models; Mean square error methods; Noise reduction; Speech analysis; Speech enhancement; State estimation; System testing; Working environment noise;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location :
Munich
ISSN :
1520-6149
Print_ISBN :
0-8186-7919-0
Type :
conf
DOI :
10.1109/ICASSP.1997.596152
Filename :
596152
Link To Document :
بازگشت