• DocumentCode
    1395763
  • Title

    Psychoacoustically Constrained and Distortion Minimized Speech Enhancement

  • Author

    Jo, Seokhwan ; Yoo, Chang D.

  • Author_Institution
    Dept. of Electr. Eng., Korea Adv. Inst. of Sci. & Technol., Daejeon, South Korea
  • Volume
    18
  • Issue
    8
  • fYear
    2010
  • Firstpage
    2099
  • Lastpage
    2110
  • Abstract
    This paper considers a psychoacoustically constrained and distortion minimized speech enhancement algorithm. Noise reduction, in general, leads to speech distortion, and a balanced tradeoff between noise reduction and speech distortion must be attained. A constrained optimization problem is set to reduce noise so that speech distortion is minimized while the sum of speech distortion and residual noise is kept below the masking threshold of the clean speech. Obtaining a solution to the optimization problem may be infeasible under certain conditions, and a slack variable is introduced to allow certain deviation from the constraint conditions. To estimate the power spectral density and also the masking threshold of clean speech, a speech model that assumes coexisting deterministic and stochastic components in speech is used. Experimental results show that the considered algorithm outperforms some of the more popular algorithms in terms of improvement in segmental signal-to-noise ratio (SegSNR), spectral distance (SD), modified Bark spectral distortion (MBSD), and mean opinion score (MOS).
  • Keywords
    acoustic noise; speech enhancement; clean speech; constrained optimization problem; distortion minimized speech enhancement; masking threshold; mean opinion score; modified Bark spectral distortion; noise reduction; power spectral density; psychoacoustically constrained speech enhancement; segmental signal-to-noise ratio; spectral distance; speech distortion; Constraint optimization; Distortion; Masking threshold; Noise reduction; Psychoacoustic models; Psychology; Signal to noise ratio; Speech coding; Speech enhancement; Stochastic processes; Constrained optimization; deterministic component; masking threshold; speech enhancement; stochastic component;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2010.2041119
  • Filename
    5398888