DocumentCode
310599
Title
HMM-based speech enhancement using harmonic modeling
Author
Deisher, Michael E. ; Spanias, Andreas S.
Author_Institution
Intel Corp., Hillsboro, OR, USA
Volume
2
fYear
1997
fDate
21-24 Apr 1997
Firstpage
1175
Abstract
This paper describes a technique for reduction of non-stationary noise in electronic voice communication systems. Removal of noise is needed in many such systems, particularly those deployed in harsh mobile or otherwise dynamic acoustic environments. The proposed method employs state-based statistical models of both speech and noise, and is thus capable of tracking variations in noise during sustained speech. This work extends the hidden Markov model (HMM) based minimum mean square error (MMSE) estimator to incorporate a ternary voicing state, and applies it to a harmonic representation of voiced speech. Noise reduction during voiced sounds is thereby improved. Performance is evaluated using speech and noise from standard databases. The extended algorithm is demonstrated to improve speech quality as measured by informal preference tests and objective measures, to preserve speech intelligibility as measured by informal diagnostic rhyme tests, and to improve the performance of a low bit-rate speech coder and a speech recognition system when used as a pre-processor
Keywords
acoustic noise; harmonic analysis; hidden Markov models; interference suppression; least mean squares methods; speech coding; speech enhancement; speech intelligibility; speech processing; speech recognition; voice communication; HMM; MMSE estimator; dynamic acoustic environments; extended algorithm; harmonic modeling; harmonic representation; hidden Markov model; informal diagnostic rhyme tests; informal preference tests; low bit-rate speech coder; minimum mean square error estimator; non-stationary noise reduction; objective measures; performance evaluation; speech enhancement; speech intelligibility; speech quality improvement; speech recognition system; state-based statistical models; ternary voicing state; voice communication systems; voiced speech; Acoustic noise; Databases; Hidden Markov models; Mean square error methods; Noise reduction; Speech analysis; Speech enhancement; State estimation; System testing; Working environment noise;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
Conference_Location
Munich
ISSN
1520-6149
Print_ISBN
0-8186-7919-0
Type
conf
DOI
10.1109/ICASSP.1997.596152
Filename
596152
Link To Document