• DocumentCode
    1798507
  • Title

    Suitability of speech quality evaluation measures in speech enhancement

  • Author

    Zhang Jie ; Xiaoqun Zhao ; Jingyun Xu ; Zhang Yang

  • Author_Institution
    Coll. of Electron. & Inf. Eng., Tongji Univ., Shanghai, China
  • fYear
    2014
  • fDate
    7-9 July 2014
  • Firstpage
    22
  • Lastpage
    26
  • Abstract
    In this paper, we discuss the suitability of speech quality evaluation measures under various noise environments in the application of spectral subtraction speech enhancement. We take three kinds of typical noise and evaluate comprehensively the speech quality under the standard of global signal-to-noise ratio of noisy speech. We take six kinds of quality measures which include mean opinion score, perceptual evaluation of speech quality, segmental signal-to-noise ratio, weighted spectral slope, log-likelihood ratio and log spectral distance. Then appropriate evaluation algorithms are chosen for speech enhancement based on spectral subtraction. The simulation results show that in the application of speech enhancement, the suitability of speech quality evaluation algorithms is limited to the SNR of noisy speech, recording people, recording content and background noise environment; with regard to three kinds of practical background noise which are taken from TIMIT database, the suitability of the segmental signal-to-noise ratio is the worst and log spectral distance is the best.
  • Keywords
    signal denoising; speech enhancement; SNR; TIMIT database; background noise environment; log spectral distance; log-likelihood ratio; noise environments; noisy speech; segmental signal-to-noise ratio; signal-to-noise ratio; spectral subtraction; spectral subtraction speech enhancement; speech quality evaluation algorithms; weighted spectral slope; Noise measurement; Production facilities; Signal to noise ratio; Speech; Speech enhancement; log spectral distance; perceptual evaluation of speech quality; speech enhancement; speech evaluation quality; weighted spectral slope;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Audio, Language and Image Processing (ICALIP), 2014 International Conference on
  • Conference_Location
    Shanghai
  • Print_ISBN
    978-1-4799-3902-2
  • Type

    conf

  • DOI
    10.1109/ICALIP.2014.7009749
  • Filename
    7009749