• DocumentCode
    1580582
  • Title

    Detection of word groups based on irregular pyramid

  • Author

    Loo, Poh Kok ; Tan, Chew Lim

  • Author_Institution
    Sch. of the Built Environ. & Design, Singapore Polytech., Singapore
  • fYear
    2001
  • fDate
    6/23/1905 12:00:00 AM
  • Firstpage
    200
  • Lastpage
    204
  • Abstract
    This paper proposes a new algorithm to detect word groups in imaged documents, using an irregular pyramid. The uniqueness of this algorithm is its inclusion of strategic background information in the analysis, which most techniques have discarded. Both the foreground (i.e. text-area) and portions of the background (i.e. white-area) regions are examined. The fundamental aspect of the algorithm is based on the concept of "closeness", where text information within a group is closer to other text information within the group, in terms of spatial distance, compared to other text areas. The result produced by the algorithm is encouraging, with the ability to correctly group words of different sizes, fonts, arrangements and orientations
  • Keywords
    document image processing; image segmentation; background regions; fonts; foreground regions; imaged documents; irregular pyramid; spatial distance; strategic background information; text area; text information closeness; white area; word arrangements; word group detection algorithm; word orientations; word size; Algorithm design and analysis; Computational efficiency; Cost benefit analysis; Data mining; Image analysis; Image processing; Information analysis; Labeling; Merging; Performance analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
  • Conference_Location
    Seattle, WA
  • Print_ISBN
    0-7695-1263-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2001.953783
  • Filename
    953783