DocumentCode
1522751
Title
Objective estimation of perceived speech quality. I. Development of the measuring normalizing block technique
Author
Voran, Stephen
Author_Institution
Inst. for Telecommun. Sci., NTIA/ITS, Boulder, CO, USA
Volume
7
Issue
4
fYear
1999
fDate
7/1/1999 12:00:00 AM
Firstpage
371
Lastpage
382
Abstract
Perceived speech quality is most directly measured by subjective listening tests. These tests are often slow and expensive, and numerous attempts have been made to supplement them with objective estimators of perceived speech quality. These attempts have found limited success, primarily in analog and higher-rate, error-free digital environments where speech waveforms are preserved or nearly preserved. The objective estimation of the perceived quality of highly compressed digital speech, possibly with bit errors or frame erasures has remained an open question. We report our findings regarding two essential components of objective estimators of perceived speech quality: perceptual transformations and distance measures. A perceptual transformation modifies a representation of an audio signal in a way that is approximately equivalent to the human hearing process. A distance measure reflects the magnitude of a perceived distance between two perceptually transformed signals. We then describe a new objective estimation approach that uses a simple but effective perceptual transformation and a distance measure that consists of a hierarchy of measuring normalizing blocks. Each measuring normalizing block integrates two perceptually transformed signals over some time or frequency interval to determine the average difference across that interval. This difference is then normalized out of one signal, and is further processed to generate one or more measurements
Keywords
estimation theory; speech intelligibility; audio signal; bit errors; distance measure; distance measures; frame erasures; highly compressed digital speech; human hearing process; measuring normalizing block technique; objective estimation; objective estimators; perceived speech quality; perceptual transformation; perceptual transformations; representation; subjective listening; Area measurement; Auditory system; Bit rate; Costs; Delay; Signal processing; Speech codecs; Speech coding; Speech enhancement; System testing;
fLanguage
English
Journal_Title
Speech and Audio Processing, IEEE Transactions on
Publisher
ieee
ISSN
1063-6676
Type
jour
DOI
10.1109/89.771259
Filename
771259
Link To Document