• DocumentCode
    384087
  • Title

    Character pattern extraction from colorful documents with complex backgrounds

  • Author

    Goto, Hideaki ; Aso, Hirotomo

  • Author_Institution
    Inf. Synergy Center, Tohoku Univ., Sendai, Japan
  • Volume
    3
  • fYear
    2002
  • fDate
    2002
  • Firstpage
    180
  • Abstract
    Today there are lots of documents in which text characters are printed on colored and/or complex backgrounds. We previously proposed a character pattern extraction method by which character patterns can be extracted from grayscale document images with complex background. The method has unique, advantageous properties; it is capable of extracting very small characters and is tolerant of shadings of images. However, the method did not work well for some color documents since it lacks the ability of discriminating color difference. This paper proposes an enhanced version of the method, which utilizes the local color segmentation and the region growing. The experimental results have shown that the new method yields much better results for color magazine covers without sacrificing the performance of extracting small character patterns. The method is tolerant of shadings of images as well.
  • Keywords
    document image processing; image colour analysis; image segmentation; optical character recognition; character pattern extraction; color difference; colorful documents; complex backgrounds; experimental results; grayscale document images; image shadings; local color segmentation; region growing; Brightness; Cameras; Character recognition; Color; Computer graphics; Data mining; Gray-scale; Image segmentation; Text analysis; Text recognition;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2002. Proceedings. 16th International Conference on
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-1695-X
  • Type

    conf

  • DOI
    10.1109/ICPR.2002.1047824
  • Filename
    1047824