DocumentCode :
514940
Title :
Run-Based Approach to Labeling Connected Components in Document Images
Author :
Tu, Xiao ; Lu, Yue
Author_Institution :
Dept. of Comput. Sci. & Technol., East China Normal Univ., Shanghai, China
Volume :
2
fYear :
2010
fDate :
6-7 March 2010
Firstpage :
206
Lastpage :
209
Abstract :
A fast algorithm is proposed in this paper to label connected components in binary document images. Runs are extracted from the image row by row. The positional relations among the runs of current rows and the runs of their preceding rows are represented utilizing trees, where each tree corresponds to a connected component. Only one-pass scan is required for the proposed approach to obtain the characteristics of the connected components, such as bounding rectangle, area, number of pixels. It is thus a fast and effective algorithm. Experimental results have shown that the efficiency of the present algorithm is superior to that of the conventional algorithms in terms of computational speed.
Keywords :
document image processing; image recognition; binary document images; document image recognition systems; labeling connected components; run-based approach; Computer science; Computer science education; Data mining; Educational technology; Flowcharts; Image analysis; Image storage; Labeling; Pixel; Text analysis; connected component; document image analysis; run-based; tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Education Technology and Computer Science (ETCS), 2010 Second International Workshop on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-6388-6
Electronic_ISBN :
978-1-4244-6389-3
Type :
conf
DOI :
10.1109/ETCS.2010.424
Filename :
5459935
Link To Document :
بازگشت