• DocumentCode
    2144330
  • Title

    Detection and Segmentation of Antialiased Text in Screen Images

  • Author

    Gleichman, Sivan ; Ophir, Boaz ; Geva, Amir ; Marder, Mattias ; Barkan, Ella ; Packer, Eli

  • Author_Institution
    IBM Res., Haifa, Israel
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    424
  • Lastpage
    428
  • Abstract
    Various software applications deal with analyzing the textual content of screen captures. Interpreting these images as text poses several challenges, relative to images traditionally handled by optical character recognition (OCR) engines. One such challenge is caused by text antialiasing, a technique which blurs the edges of characters, to reduce jagged appearance. This blurring changes the character images according to context, and can sometimes fuse them together. In this paper, we offer a low-cost method that can be used as a preprocessing stage, prior to OCR. Our method locates antialiased text in a screen image and segments it into separate character images. Our proposed algorithm significantly improves OCR results, particularly in images with colored text of small font size, such as in graphic user interface (GUI) screens.
  • Keywords
    image segmentation; optical character recognition; text analysis; OCR engines; antialiased text detection; antialiased text segmentation; character edge blurring; character images; optical character recognition; screen captures; screen images; software applications; text antialiasing; textual content; Engines; Gray-scale; Histograms; Image color analysis; Image segmentation; Optical character recognition software; User interfaces; antialiasing; character segmentation; text detection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.92
  • Filename
    6065347