• DocumentCode
    302884
  • Title

    Text segmentation in mixed-mode images using classification trees and transform tree-structured vector quantization

  • Author

    Perlmutter, Keren O. ; Chaddha, N. ; Buckheit, Jonnthan B. ; Gray, Robert M. ; Olshen, Richard A.

  • Author_Institution
    Inf. Syst. Lab., Stanford Univ., CA, USA
  • Volume
    4
  • fYear
    1996
  • fDate
    7-10 May 1996
  • Firstpage
    2231
  • Abstract
    Multimedia applications such as educational videos and color facsimile contain images that are rich in both textual and continuous tone data. Because these two types of data have different properties, segmentation of the images into text and continuous tone data can improve compression by allowing different compression parameters or even algorithms to be employed on the different types. We propose and compare algorithms that use classification trees (CLTR) or tree-structured vector quantization (TSVQ) for block-based classification in mixed-mode images. We also examine different types of features that can be used in these classifiers. The results show that using linear transform features with either the CLTR or TSVQ can be effective for accurate text classification. In addition, the results indicate that combining these classifiers with another TSVQ that is designed simultaneously to minimize both compression and classification error can provide better classification than does either system alone
  • Keywords
    coding errors; feature extraction; image classification; image coding; image segmentation; multimedia communication; transform coding; trees (mathematics); vector quantisation; TSVQ; block based classification; classification error; classification trees; color facsimile; compression error; compression parameters; continuous tone data; educational videos; image segmentation; linear transform features; mixed mode images; multimedia applications; text classification; text segmentation; textual data; transform tree structured vector quantization; Classification algorithms; Classification tree analysis; Costs; Facsimile; Image coding; Image edge detection; Image segmentation; Information systems; Vector quantization; Videos;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Acoustics, Speech, and Signal Processing, 1996. ICASSP-96. Conference Proceedings., 1996 IEEE International Conference on
  • Conference_Location
    Atlanta, GA
  • ISSN
    1520-6149
  • Print_ISBN
    0-7803-3192-3
  • Type

    conf

  • DOI
    10.1109/ICASSP.1996.545865
  • Filename
    545865