• DocumentCode
    3381341
  • Title

    Extraction of text in images

  • Author

    Malik, Rohit ; SeongAh, Chin

  • Author_Institution
    Dept. of Electr. Eng. & Comput. Eng., New Jersey Inst. of Technol., Newark, NJ, USA
  • fYear
    1999
  • fDate
    1999
  • Firstpage
    534
  • Lastpage
    537
  • Abstract
    In this paper we present a text segmentation technique that is useful in locating and extracting text blocks in images. The algorithm works without prior knowledge of the text orientation, size or font. It is designed to eliminate background image information and to highlight or identify the regions of the image that contain text. The algorithm uses the fact that text regions in an image may be identified by searching for several repeated instances of uniform gray intensity of approximately the same width. Combining this with the fact that the ratio of type-face stroke width to height is often fixed provides a useful technique for extracting text from images. Results of the application of this algorithm are presented
  • Keywords
    feature extraction; image segmentation; background image information elimination; image; text block extraction; text block location; text segmentation technique; type-face stroke width/height ratio; uniform gray intensity; Clustering algorithms; Computer science; Computer vision; Data mining; Humans; Image segmentation; Layout; Read only memory; Shape; Smoothing methods;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Intelligence and Systems, 1999. Proceedings. 1999 International Conference on
  • Conference_Location
    Bethesda, MD
  • Print_ISBN
    0-7695-0446-9
  • Type

    conf

  • DOI
    10.1109/ICIIS.1999.810343
  • Filename
    810343