DocumentCode
1580582
Title
Detection of word groups based on irregular pyramid
Author
Loo, Poh Kok ; Tan, Chew Lim
Author_Institution
Sch. of the Built Environ. & Design, Singapore Polytech., Singapore
fYear
2001
fDate
6/23/1905 12:00:00 AM
Firstpage
200
Lastpage
204
Abstract
This paper proposes a new algorithm to detect word groups in imaged documents, using an irregular pyramid. The uniqueness of this algorithm is its inclusion of strategic background information in the analysis, which most techniques have discarded. Both the foreground (i.e. text-area) and portions of the background (i.e. white-area) regions are examined. The fundamental aspect of the algorithm is based on the concept of "closeness", where text information within a group is closer to other text information within the group, in terms of spatial distance, compared to other text areas. The result produced by the algorithm is encouraging, with the ability to correctly group words of different sizes, fonts, arrangements and orientations
Keywords
document image processing; image segmentation; background regions; fonts; foreground regions; imaged documents; irregular pyramid; spatial distance; strategic background information; text area; text information closeness; white area; word arrangements; word group detection algorithm; word orientations; word size; Algorithm design and analysis; Computational efficiency; Cost benefit analysis; Data mining; Image analysis; Image processing; Information analysis; Labeling; Merging; Performance analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location
Seattle, WA
Print_ISBN
0-7695-1263-1
Type
conf
DOI
10.1109/ICDAR.2001.953783
Filename
953783
Link To Document