• DocumentCode
    2975386
  • Title

    Gabor Filter Based Text Extraction from Digital Document Images

  • Author

    Qiao, Yu-Long ; Li, Meng ; Lu, Zhe-Ming ; Sun, Sheng-he

  • Author_Institution
    Harbin Institute of Technology, China; Harbin Engineering University, China
  • fYear
    2006
  • fDate
    Dec. 2006
  • Firstpage
    297
  • Lastpage
    300
  • Abstract
    The automatic text detection in document images is useful for many applications. This paper presents an algorithm that can automatically detect and extract text in digital document images. Firstly, we process and fuse Gabor filtered images at different orientations and scales and obtain an image that reflects the layout of the document image. Then, potential text regions are directly extracted from the resulting image. Finally, two criteria based on the geometrical property and high frequency content are adopted to kick-out those non-text regions. The experiments are performed on some representative images with different styles and with texts in different languages and fonts. Experimental results show that the algorithm works well on document images from a wide variety of source.
  • Keywords
    Algorithm design and analysis; Automatic control; Automatic testing; Content based retrieval; Data mining; Frequency; Gabor filters; Image retrieval; Image segmentation; Indexing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Information Hiding and Multimedia Signal Processing, 2006. IIH-MSP '06. International Conference on
  • Conference_Location
    Pasadena, CA, USA
  • Print_ISBN
    0-7695-2745-0
  • Type

    conf

  • DOI
    10.1109/IIH-MSP.2006.265002
  • Filename
    4041722