DocumentCode :
1580582
Title :
Detection of word groups based on irregular pyramid
Author :
Loo, Poh Kok ; Tan, Chew Lim
Author_Institution :
Sch. of the Built Environ. & Design, Singapore Polytech., Singapore
fYear :
2001
fDate :
6/23/1905 12:00:00 AM
Firstpage :
200
Lastpage :
204
Abstract :
This paper proposes a new algorithm to detect word groups in imaged documents, using an irregular pyramid. The uniqueness of this algorithm is its inclusion of strategic background information in the analysis, which most techniques have discarded. Both the foreground (i.e. text-area) and portions of the background (i.e. white-area) regions are examined. The fundamental aspect of the algorithm is based on the concept of "closeness", where text information within a group is closer to other text information within the group, in terms of spatial distance, compared to other text areas. The result produced by the algorithm is encouraging, with the ability to correctly group words of different sizes, fonts, arrangements and orientations
Keywords :
document image processing; image segmentation; background regions; fonts; foreground regions; imaged documents; irregular pyramid; spatial distance; strategic background information; text area; text information closeness; white area; word arrangements; word group detection algorithm; word orientations; word size; Algorithm design and analysis; Computational efficiency; Cost benefit analysis; Data mining; Image analysis; Image processing; Information analysis; Labeling; Merging; Performance analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2001. Proceedings. Sixth International Conference on
Conference_Location :
Seattle, WA
Print_ISBN :
0-7695-1263-1
Type :
conf
DOI :
10.1109/ICDAR.2001.953783
Filename :
953783
Link To Document :
بازگشت