• DocumentCode
    1118057
  • Title

    High-Rate Optimized Recursive Vector Quantization Structures Using Hidden Markov Models

  • Author

    Duni, Ethan R. ; Rao, Bhaskar D.

  • Author_Institution
    Dept. of Electr. & Comput. Eng., California Univ., San Diego, La Jolla, CA
  • Volume
    15
  • Issue
    3
  • fYear
    2007
  • fDate
    3/1/2007 12:00:00 AM
  • Firstpage
    756
  • Lastpage
    769
  • Abstract
    This paper examines the design of recursive vector quantization systems built around Gaussian mixture vector quantizers. The problem of designing such systems for minimum high-rate distortion, under input-weighted squared error, is discussed. It is shown that, in high dimensions, the design problem becomes equivalent to a weighted maximum likelihood problem. A variety of recursive coding schemes, based on hidden Markov models are presented. The proposed systems are applied to the problem of wideband speech line spectral frequency (LSF) quantization under the log spectral distortion (LSD) measure. By combining recursive quantization and random coding techniques, the systems are able to attain transparent quality at rates as low as 36 bits per frame
  • Keywords
    Gaussian processes; hidden Markov models; maximum likelihood estimation; random codes; speech coding; vector quantisation; Gaussian mixture vector quantizers; hidden Markov models; high-rate optimized recursive vector quantization structures; input-weighted squared error; log spectral distortion measure; minimum high-rate distortion; random coding technique; recursive coding schemes; weighted maximum likelihood problem; wideband speech line spectral frequency quantization; Distortion measurement; Frequency measurement; Hidden Markov models; Image coding; Maximum likelihood estimation; Nonlinear distortion; Parameter estimation; Speech; Vector quantization; Wideband; Hidden Markov model (HMM); high-rate quantization; log spectral distortion (LSD); parameter estimation; wideband speech;
  • fLanguage
    English
  • Journal_Title
    Audio, Speech, and Language Processing, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1558-7916
  • Type

    jour

  • DOI
    10.1109/TASL.2006.885903
  • Filename
    4100675