DocumentCode
3443812
Title
Vector quantization techniques for output-based objective speech quality
Author
Jin, Chiyi ; Kubichek, Robert
Author_Institution
Dept. of Electr. Eng., Wyoming Univ., Laramie, WY, USA
Volume
1
fYear
1996
fDate
7-10 May 1996
Firstpage
491
Abstract
Output-based speech quality (OBQ) refers to objective speech quality assessment using only received speech without utilizing the input speech record. This paper proposes three new OBQ measures and evaluates their performance. Parameters derived from perceptual linear prediction (PLP) coefficients are used to provide speaker independence required by the objective measures. PLP, PLP cepstrum, and PLP delta-cepstrum parameters are computed for output speech records from an undistorted source speech database and vector quantized. The resulting codebook provides a reference for computing objective distance measures for distorted speech. The proposed objective measures are the transition probability distance, the median minimum distance, and the chi-squared distance. The OBQ parameters are tested on four different speech datasets, and correlation is computed between subjective scores and objective distances under a variety of conditions. The results indicate that the proposed algorithms are robust against speaker, text, and distortion variation
Keywords
cepstral analysis; correlation methods; linear predictive coding; parameter estimation; probability; speech coding; speech intelligibility; vector quantisation; PLP cepstrum; PLP coefficients; PLP delta-cepstrum parameters; VQ; chisquared distance; codebook; correlation; distorted speech; median minimum distance; objective distances; output speech records; output-based objective speech quality; perceptual linear prediction coefficients; performance evaluation; received speech; robust algorithm; speaker independence; subjective scores; transition probability distance; undistorted source speech database; vector quantization; Cepstrum; Databases; Distortion measurement; Oral communication; Robustness; Speech coding; Speech enhancement; Speech processing; Testing; Vector quantization;
fLanguage
English
Publisher
ieee
Conference_Titel
Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
Conference_Location
Atlanta, GA
ISSN
1520-6149
Print_ISBN
0-7803-3192-3
Type
conf
DOI
10.1109/ICASSP.1996.541140
Filename
541140
Link To Document