• DocumentCode
    310599
  • Title

    HMM-based speech enhancement using harmonic modeling

  • Author

    Deisher, Michael E. ; Spanias, Andreas S.

  • Author_Institution
    Intel Corp., Hillsboro, OR, USA
  • Volume
    2
  • fYear
    1997
  • fDate
    21-24 Apr 1997
  • Firstpage
    1175
  • Abstract
    This paper describes a technique for reduction of non-stationary noise in electronic voice communication systems. Removal of noise is needed in many such systems, particularly those deployed in harsh mobile or otherwise dynamic acoustic environments. The proposed method employs state-based statistical models of both speech and noise, and is thus capable of tracking variations in noise during sustained speech. This work extends the hidden Markov model (HMM) based minimum mean square error (MMSE) estimator to incorporate a ternary voicing state, and applies it to a harmonic representation of voiced speech. Noise reduction during voiced sounds is thereby improved. Performance is evaluated using speech and noise from standard databases. The extended algorithm is demonstrated to improve speech quality as measured by informal preference tests and objective measures, to preserve speech intelligibility as measured by informal diagnostic rhyme tests, and to improve the performance of a low bit-rate speech coder and a speech recognition system when used as a pre-processor
  • Keywords
    acoustic noise; harmonic analysis; hidden Markov models; interference suppression; least mean squares methods; speech coding; speech enhancement; speech intelligibility; speech processing; speech recognition; voice communication; HMM; MMSE estimator; dynamic acoustic environments; extended algorithm; harmonic modeling; harmonic representation; hidden Markov model; informal diagnostic rhyme tests; informal preference tests; low bit-rate speech coder; minimum mean square error estimator; non-stationary noise reduction; objective measures; performance evaluation; speech enhancement; speech intelligibility; speech quality improvement; speech recognition system; state-based statistical models; ternary voicing state; voice communication systems; voiced speech; Acoustic noise; Databases; Hidden Markov models; Mean square error methods; Noise reduction; Speech analysis; Speech enhancement; State estimation; System testing; Working environment noise;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1997. ICASSP-97., 1997 IEEE International Conference on
  • Conference_Location
    Munich
  • ISSN
    1520-6149
  • Print_ISBN
    0-8186-7919-0
  • Type

    conf

  • DOI
    10.1109/ICASSP.1997.596152
  • Filename
    596152