• DocumentCode
    3330756
  • Title

    A language for document generic layout description and its use for segmentation into regions

  • Author

    Azokly, Antoine ; Ingold, Rolf

  • Author_Institution
    Inst. of Inf., Fribourg Univ., Switzerland
  • Volume
    2
  • fYear
    1995
  • fDate
    14-16 Aug 1995
  • Firstpage
    1123
  • Abstract
    We present a segmentation method guided by a generic layout description expressed in a new language. The proposed language allows to describe a page as superposed layers that may be used to separate the main text body from other components, for example figures. The language´s novelty resides in the fact that, instead of describing directly the global topology of generic pages according to their regions, generic separators are described and used as region boundary delimiters. Separators may be declared as white spaces or threads. By doing this, the problem of document segmentation into regions has become a problem of separator determination, solved by analyzing lines and white spaces contained in documents
  • Keywords
    document image processing; image segmentation; page description languages; document generic layout description language; generic layout description; region boundary delimiters; segmentation method; separator determination; Image analysis; Image recognition; Image segmentation; Layout; Page description languages; Particle separators; Strontium; Text analysis; White spaces; Writing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 1995., Proceedings of the Third International Conference on
  • Conference_Location
    Montreal, Que.
  • Print_ISBN
    0-8186-7128-9
  • Type

    conf

  • DOI
    10.1109/ICDAR.1995.602117
  • Filename
    602117