HMM-based speech enhancement using harmonic modeling

Author

Deisher, Michael E. ; Spanias, Andreas S.

Author_Institution

Intel Corp., Hillsboro, OR, USA

Volume

2

fYear

1997

fDate

21-24 Apr 1997

Firstpage

1175

Abstract

This paper describes a technique for reduction of non-stationary noise in electronic voice communication systems. Removal of noise is needed in many such systems, particularly those deployed in harsh mobile or otherwise dynamic acoustic environments. The proposed method employs state-based statistical models of both speech and noise, and is thus capable of tracking variations in noise during sustained speech. This work extends the hidden Markov model (HMM) based minimum mean square error (MMSE) estimator to incorporate a ternary voicing state, and applies it to a harmonic representation of voiced speech. Noise reduction during voiced sounds is thereby improved. Performance is evaluated using speech and noise from standard databases. The extended algorithm is demonstrated to improve speech quality as measured by informal preference tests and objective measures, to preserve speech intelligibility as measured by informal diagnostic rhyme tests, and to improve the performance of a low bit-rate speech coder and a speech recognition system when used as a pre-processor

Keywords

acoustic noise; harmonic analysis; hidden Markov models; interference suppression; least mean squares methods; speech coding; speech enhancement; speech intelligibility; speech processing; speech recognition; voice communication; HMM; MMSE estimator; dynamic acoustic environments; extended algorithm; harmonic modeling; harmonic representation; hidden Markov model; informal diagnostic rhyme tests; informal preference tests; low bit-rate speech coder; minimum mean square error estimator; non-stationary noise reduction; objective measures; performance evaluation; speech enhancement; speech intelligibility; speech quality improvement; speech recognition system; state-based statistical models; ternary voicing state; voice communication systems; voiced speech; Acoustic noise; Databases; Hidden Markov models; Mean square error methods; Noise reduction; Speech analysis; Speech enhancement; State estimation; System testing; Working environment noise;

fLanguage

English

Publisher

ieee

Conference_Titel

Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on

Conference_Location

Munich

ISSN

1520-6149

Print_ISBN

0-8186-7919-0

Type

conf

DOI

10.1109/ICASSP.1997.596152

Filename

596152