DocumentCode :
2870540
Title :
Automatic generation of structured hyperdocuments from multi-column document images
Author :
Lee, Ji-Yeon ; Choi, Song-Ha ; Lee, Seong-Whan
Author_Institution :
Center for Artificial Vision Res., Korea Univ., Seoul, South Korea
Volume :
4
fYear :
2000
fDate :
2000
Firstpage :
422
Abstract :
We propose two methods for converting complex multi-column document images into HTML documents, and a method for generating a structured table of contents (ToC) page based on the logical structure analysis of the document image. Experiments with various kinds of multi-column document images show that HTML documents corresponding to the paper documents can be generated in a visual layout, and that their structured table of contents page, with the hierarchically ordered section titles hyperlinked to the contents, can be also produced by the proposed methods
Keywords :
document image processing; hypermedia markup languages; merging; HTML documents; multi-column document images; structured hyperdocuments; structured table of contents; HTML; Image analysis; Image converters; Image segmentation; Internet; Merging; Research initiatives; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Pattern Recognition, 2000. Proceedings. 15th International Conference on
Conference_Location :
Barcelona
ISSN :
1051-4651
Print_ISBN :
0-7695-0750-6
Type :
conf
DOI :
10.1109/ICPR.2000.902948
Filename :
902948
Link To Document :
بازگشت