• DocumentCode
    839052
  • Title

    Speech enhancement using a mixture-maximum model

  • Author

    Burshtein, David ; Gannot, Sharon

  • Author_Institution
    Dept. of Electr. Eng.-Syst., Tel Aviv Univ., Israel
  • Volume
    10
  • Issue
    6
  • fYear
    2002
  • fDate
    9/1/2002 12:00:00 AM
  • Firstpage
    341
  • Lastpage
    351
  • Abstract
    We present a spectral domain, speech enhancement algorithm. The new algorithm is based on a mixture model for the short time spectrum of the clean speech signal, and on a maximum assumption in the production of the noisy speech spectrum. In the past this model was used in the context of noise robust speech recognition. In this paper we show that this model is also effective for improving the quality of speech signals corrupted by additive noise. The computational requirements of the algorithm can be significantly reduced, essentially without paying performance penalties, by incorporating a dual codebook scheme with tied variances. Experiments, using recorded speech signals and actual noise sources, show that in spite of its low computational requirements, the algorithm shows improved performance compared to alternative speech enhancement algorithms.
  • Keywords
    Gaussian processes; computational complexity; noise; spectral analysis; speech coding; speech enhancement; speech intelligibility; speech recognition; Gaussian mixture model; MIXMAX model; additive noise; clean speech signal; dual codebook; low computational requirements; mixture model; mixture-maximum model; noise robust speech recognition; noise sources; noisy speech spectrum; performance penalties; recorded speech signals; short time spectrum; spectral domain; speech enhancement algorithm; speech intelligibility; speech signal quality; tied variances; Additive noise; Amplitude estimation; Background noise; Context modeling; Filtering algorithms; Hidden Markov models; Noise robustness; Speech coding; Speech enhancement; Speech recognition;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/TSA.2002.803420
  • Filename
    1040258