Title :
Objective estimation of perceived speech quality. I. Development of the measuring normalizing block technique
Author_Institution :
Inst. for Telecommun. Sci., NTIA/ITS, Boulder, CO, USA
fDate :
7/1/1999 12:00:00 AM
Abstract :
Perceived speech quality is most directly measured by subjective listening tests. These tests are often slow and expensive, and numerous attempts have been made to supplement them with objective estimators of perceived speech quality. These attempts have found limited success, primarily in analog and higher-rate, error-free digital environments where speech waveforms are preserved or nearly preserved. The objective estimation of the perceived quality of highly compressed digital speech, possibly with bit errors or frame erasures has remained an open question. We report our findings regarding two essential components of objective estimators of perceived speech quality: perceptual transformations and distance measures. A perceptual transformation modifies a representation of an audio signal in a way that is approximately equivalent to the human hearing process. A distance measure reflects the magnitude of a perceived distance between two perceptually transformed signals. We then describe a new objective estimation approach that uses a simple but effective perceptual transformation and a distance measure that consists of a hierarchy of measuring normalizing blocks. Each measuring normalizing block integrates two perceptually transformed signals over some time or frequency interval to determine the average difference across that interval. This difference is then normalized out of one signal, and is further processed to generate one or more measurements
Keywords :
estimation theory; speech intelligibility; audio signal; bit errors; distance measure; distance measures; frame erasures; highly compressed digital speech; human hearing process; measuring normalizing block technique; objective estimation; objective estimators; perceived speech quality; perceptual transformation; perceptual transformations; representation; subjective listening; Area measurement; Auditory system; Bit rate; Costs; Delay; Signal processing; Speech codecs; Speech coding; Speech enhancement; System testing;
Journal_Title :
Speech and Audio Processing, IEEE Transactions on