• DocumentCode
    3443812
  • Title

    Vector quantization techniques for output-based objective speech quality

  • Author

    Jin, Chiyi ; Kubichek, Robert

  • Author_Institution
    Dept. of Electr. Eng., Wyoming Univ., Laramie, WY, USA
  • Volume
    1
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    491
  • Abstract
    Output-based speech quality (OBQ) refers to objective speech quality assessment using only received speech without utilizing the input speech record. This paper proposes three new OBQ measures and evaluates their performance. Parameters derived from perceptual linear prediction (PLP) coefficients are used to provide speaker independence required by the objective measures. PLP, PLP cepstrum, and PLP delta-cepstrum parameters are computed for output speech records from an undistorted source speech database and vector quantized. The resulting codebook provides a reference for computing objective distance measures for distorted speech. The proposed objective measures are the transition probability distance, the median minimum distance, and the chi-squared distance. The OBQ parameters are tested on four different speech datasets, and correlation is computed between subjective scores and objective distances under a variety of conditions. The results indicate that the proposed algorithms are robust against speaker, text, and distortion variation
  • Keywords
    cepstral analysis; correlation methods; linear predictive coding; parameter estimation; probability; speech coding; speech intelligibility; vector quantisation; PLP cepstrum; PLP coefficients; PLP delta-cepstrum parameters; VQ; chisquared distance; codebook; correlation; distorted speech; median minimum distance; objective distances; output speech records; output-based objective speech quality; perceptual linear prediction coefficients; performance evaluation; received speech; robust algorithm; speaker independence; subjective scores; transition probability distance; undistorted source speech database; vector quantization; Cepstrum; Databases; Distortion measurement; Oral communication; Robustness; Speech coding; Speech enhancement; Speech processing; Testing; Vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.541140
  • Filename
    541140