• DocumentCode
    2870540
  • Title

    Automatic generation of structured hyperdocuments from multi-column document images

  • Author

    Lee, Ji-Yeon ; Choi, Song-Ha ; Lee, Seong-Whan

  • Author_Institution
    Center for Artificial Vision Res., Korea Univ., Seoul, South Korea
  • Volume
    4
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    422
  • Abstract
    We propose two methods for converting complex multi-column document images into HTML documents, and a method for generating a structured table of contents (ToC) page based on the logical structure analysis of the document image. Experiments with various kinds of multi-column document images show that HTML documents corresponding to the paper documents can be generated in a visual layout, and that their structured table of contents page, with the hierarchically ordered section titles hyperlinked to the contents, can be also produced by the proposed methods
  • Keywords
    document image processing; hypermedia markup languages; merging; HTML documents; multi-column document images; structured hyperdocuments; structured table of contents; HTML; Image analysis; Image converters; Image segmentation; Internet; Merging; Research initiatives; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Pattern Recognition, 2000. Proceedings. 15th International Conference on
  • Conference_Location
    Barcelona
  • ISSN
    1051-4651
  • Print_ISBN
    0-7695-0750-6
  • Type

    conf

  • DOI
    10.1109/ICPR.2000.902948
  • Filename
    902948