• DocumentCode
    2803781
  • Title

    Gaussian-mixture modeling of lattice-based spherical vector quantization performance in transform audio coding

  • Author

    Patchoo, Wisarn ; Fischer, Thomas R.

  • Author_Institution
    Sch. of EECS, Washington State Univ., Pullman, WA, USA
  • fYear
    2010
  • fDate
    14-19 March 2010
  • Firstpage
    373
  • Lastpage
    376
  • Abstract
    A block-based Gaussian mixture model (GMM) is used to model the distribution of transform audio data to be encoded using lattice-based spherical vector quantization (LSVQ). The expectation-maximization algorithm is used to design the GMM to model the marginal density of the transform coefficients and the vector energy density. A GMM-based rate-distortion function is derived and shown to closely match the observed spherical VQ performance. The LSVQ transform audio coding performance is characterized for the best lattices known in 4, 8, 16, and 32 dimensions.
  • Keywords
    Gaussian processes; audio coding; expectation-maximisation algorithm; quantisation (signal); transform coding; block based Gaussian mixture model; expectation-maximization algorithm; lattice based spherical vector quantization; transform audio coding; Audio coding; Gaussian processes; Vector quantization; Gaussian distributions; audio coding; rate distortion theory; vector quantization;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics Speech and Signal Processing (ICASSP), 2010 IEEE International Conference on
  • Conference_Location
    Dallas, TX
  • ISSN
    1520-6149
  • Print_ISBN
    978-1-4244-4295-9
  • Electronic_ISBN
    1520-6149
  • Type

    conf

  • DOI
    10.1109/ICASSP.2010.5495823
  • Filename
    5495823