DocumentCode
2816217
Title
Recognition and identification of form document layouts
Author
Luo, Kai ; Latifi, Shahram ; Taghva, Kazem ; Regentova, Emma
Author_Institution
Nevada Univ., Las Vegas, NV, USA
Volume
2
fYear
2004
fDate
5-7 April 2004
Firstpage
352
Abstract
We introduce a hierarchical tree representation to represent the logical structure of a form document. Different forms might have the same logical structure, so the representation will be ambiguous. We propose an improvement to solve the ambiguity problem by using the physical information of the blocks. A pixel tracing approach is used to extract form layout structures from form documents. Compared with Hough transform, it requires less computation. This algorithm has been tested on 50 different table forms. The algorithm applies to table form documents.
Keywords
Hough transforms; document handling; frame based representation; pattern recognition; Hough transform; ambiguity problem; form document; form layout extraction; hierarchical tree representation; logical structure; pixel tracing; table form document; Character recognition; Data mining; Face detection; Government; Image databases; Information technology; Optical character recognition software; Strips; Testing; Tree graphs;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN
0-7695-2108-8
Type
conf
DOI
10.1109/ITCC.2004.1286662
Filename
1286662
Link To Document