• DocumentCode
    1522751
  • Title

    Objective estimation of perceived speech quality. I. Development of the measuring normalizing block technique

  • Author

    Voran, Stephen

  • Author_Institution
    Inst. for Telecommun. Sci., NTIA/ITS, Boulder, CO, USA
  • Volume
    7
  • Issue
    4
  • fYear
    1999
  • fDate
    7/1/1999 12:00:00 AM
  • Firstpage
    371
  • Lastpage
    382
  • Abstract
    Perceived speech quality is most directly measured by subjective listening tests. These tests are often slow and expensive, and numerous attempts have been made to supplement them with objective estimators of perceived speech quality. These attempts have found limited success, primarily in analog and higher-rate, error-free digital environments where speech waveforms are preserved or nearly preserved. The objective estimation of the perceived quality of highly compressed digital speech, possibly with bit errors or frame erasures has remained an open question. We report our findings regarding two essential components of objective estimators of perceived speech quality: perceptual transformations and distance measures. A perceptual transformation modifies a representation of an audio signal in a way that is approximately equivalent to the human hearing process. A distance measure reflects the magnitude of a perceived distance between two perceptually transformed signals. We then describe a new objective estimation approach that uses a simple but effective perceptual transformation and a distance measure that consists of a hierarchy of measuring normalizing blocks. Each measuring normalizing block integrates two perceptually transformed signals over some time or frequency interval to determine the average difference across that interval. This difference is then normalized out of one signal, and is further processed to generate one or more measurements
  • Keywords
    estimation theory; speech intelligibility; audio signal; bit errors; distance measure; distance measures; frame erasures; highly compressed digital speech; human hearing process; measuring normalizing block technique; objective estimation; objective estimators; perceived speech quality; perceptual transformation; perceptual transformations; representation; subjective listening; Area measurement; Auditory system; Bit rate; Costs; Delay; Signal processing; Speech codecs; Speech coding; Speech enhancement; System testing;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.771259
  • Filename
    771259