DocumentCode
2870540
Title
Automatic generation of structured hyperdocuments from multi-column document images
Author
Lee, Ji-Yeon ; Choi, Song-Ha ; Lee, Seong-Whan
Author_Institution
Center for Artificial Vision Res., Korea Univ., Seoul, South Korea
Volume
4
fYear
2000
fDate
2000
Firstpage
422
Abstract
We propose two methods for converting complex multi-column document images into HTML documents, and a method for generating a structured table of contents (ToC) page based on the logical structure analysis of the document image. Experiments with various kinds of multi-column document images show that HTML documents corresponding to the paper documents can be generated in a visual layout, and that their structured table of contents page, with the hierarchically ordered section titles hyperlinked to the contents, can be also produced by the proposed methods
Keywords
document image processing; hypermedia markup languages; merging; HTML documents; multi-column document images; structured hyperdocuments; structured table of contents; HTML; Image analysis; Image converters; Image segmentation; Internet; Merging; Research initiatives; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location
Barcelona
ISSN
1051-4651
Print_ISBN
0-7695-0750-6
Type
conf
DOI
10.1109/ICPR.2000.902948
Filename
902948
Link To Document