• DocumentCode
    1043255
  • Title

    Theory and practice of vector quantizers trained on small training sets

  • Author

    Cohn, David ; Riskin, Eve A. ; Ladner, Richard

  • Author_Institution
    Dept. of Comput. Sci. & Eng., Washington Univ., Seattle, WA, USA
  • Volume
    16
  • Issue
    1
  • fYear
    1994
  • fDate
    1/1/1994 12:00:00 AM
  • Firstpage
    54
  • Lastpage
    65
  • Abstract
    Examines how the performance of a memoryless vector quantizer changes as a function of its training set size. Specifically, the authors study how well the training set distortion predicts test distortion when the training set is a randomly drawn subset of blocks from the test or training image(s). Using the Vapnik-Chervonenkis (VC) dimension, the authors derive formal bounds for the difference of test and training distortion of vector quantizer codebooks. The authors then describe extensive empirical simulations that test these bounds for a variety of codebook sizes and vector dimensions, and give practical suggestions for determining the training set size necessary to achieve good generalization from a codebook. The authors conclude that, by using training sets comprising only a small fraction of the available data, one can produce results that are close to the results obtainable when all available data are used
  • Keywords
    image coding; learning systems; statistics; vector quantisation; Vapnik-Chervonenkis dimension; empirical simulations; formal bounds; memoryless vector quantizer; small training sets; test distortion; training image; training set distortion; vector quantizer codebooks; Computational efficiency; Computer science; Data compression; Gray-scale; Image coding; Image storage; Pixel; Testing; Vector quantization; Virtual colonoscopy;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/34.273717
  • Filename
    273717