• DocumentCode
    2630831
  • Title

    Language-free layout analysis

  • Author

    Ittner, D.J. ; Baird, H.S.

  • Author_Institution
    AT&T Bell Lab., Murray Hill, NJ, USA
  • fYear
    1993
  • fDate
    20-22 Oct 1993
  • Firstpage
    336
  • Lastpage
    340
  • Abstract
    A system for isolating blocks, lines, words, and symbols within images of machine-printed textual documents that is, to a large existent, independent of language and writing system is described. This is achieved by exploiting a small number of nearly universal typesetting and layout conventions. The system does not require prior knowledge of page orientation (module 90°), and copes well with nonzero skew and shear angles (within 10°). Also it locates blocks of text without reliance on detailed a priori layout models, and in spite of unknown or mixed horizontal and vertical text-line orientations. Within blocks, it infers text-line orientation and isolates lines, without knowledge of the language, symbol set, text sizes, or the number of text lines. Segmentation into words and symbols, and determination of reading order, normally require some knowledge of the language: this is held to minimum by relying on shape-driven algorithms. The underlying algorithms are based on Fourier theory, digital signal processing, computational geometry, and statistical decision theory. Most of the computation occurs within algorithms that possess unambiguous semantics (that is, heuristics are kept to a minimum). The effectiveness of the method on English, Japanese, Hebrew, Thai, and Korean documents is discussed
  • Keywords
    computational geometry; document handling; image segmentation; Fourier theory; computational geometry; digital signal processing; language free layout analysis; layout conventions; machine-printed textual documents; nearly universal typesetting; page orientation; reading order; shape-driven algorithms; statistical decision theory; symbol set; text lines; text sizes; unambiguous semantics; vertical text-line orientations; Asia; Decision theory; Digital signal processing; Graphics; Natural languages; Runtime; Signal processing algorithms; Testing; Typesetting; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1993., Proceedings of the Second International Conference on
  • Conference_Location
    Tsukuba Science City
  • Print_ISBN
    0-8186-4960-7
  • Type

    conf

  • DOI
    10.1109/ICDAR.1993.395720
  • Filename
    395720