• DocumentCode
    1350236
  • Title

    Vector quantization based on Gaussian mixture models

  • Author

    Hedelin, Per ; Skoglund, Jan

  • Author_Institution
    Dept. of Signal & Syst. Eng., Chalmers Univ. of Technol., Goteborg, Sweden
  • Volume
    8
  • Issue
    4
  • fYear
    2000
  • fDate
    7/1/2000 12:00:00 AM
  • Firstpage
    385
  • Lastpage
    401
  • Abstract
    We model the underlying probability density function of vectors in a database as a Gaussian mixture (GM) model. The model is employed for high rate vector quantization analysis and for design of vector quantizers. It is shown that the high rate formulas accurately predict the performance of model-based quantizers. We propose a novel method for optimizing GM model parameters for high rate performance, and an extension to the EM algorithm for densities having bounded support is also presented. The methods are applied to quantization of LPC parameters in speech coding and we present new high rate analysis results for band-limited spectral distortion and outlier statistics. In practical terms, we find that an optimal single-stage VQ can operate at approximately 3 bits less than a state-of-the-art LSF-based 2-split VQ
  • Keywords
    Gaussian processes; bandlimited communication; linear predictive coding; optimisation; parameter estimation; probability; rate distortion theory; spectral analysis; speech coding; statistical analysis; vector quantisation; EM algorithm; Gaussian mixture models; LPC parameters; LSF-based 2-split VQ; band-limited spectral distortion; bounded support; database; high rate VQ analysis; high rate formulas; high rate performance; model parameters optimisation; model-based quantizers; optimal single-stage VQ; outlier statistics; probability density function; speech coding; vector quantization; Databases; Gaussian distribution; Linear predictive coding; Optimization methods; Predictive models; Probability density function; Speech analysis; Speech coding; Statistical analysis; Vector quantization;
  • fLanguage
    English
  • Journal_Title
    Speech and Audio Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1063-6676
  • Type

    jour

  • DOI
    10.1109/89.848220
  • Filename
    848220