DocumentCode :
2681314
Title :
Complex document image segmentation using localized histogram analysis with multi-layer matching and clustering
Author :
Chen, Yen-Lin ; Chiu, Chung-Cheng ; Wu, Bing-Fei
Author_Institution :
Dept. of Electr. & Control Eng., Nat. Chiao Tung Univ., Hsinchu, Taiwan
Volume :
4
fYear :
2004
fDate :
10-13 Oct. 2004
Firstpage :
3063
Abstract :
This paper proposes a new segmentation method to separate the text from various complex document images. An automatic multilevel thresholding method, based on discriminant analysis, is utilized to recursively segment a specified block region into several layered image sub-blocks. Then the multi-layer region based clustering method is performed to process the layered image sub-blocks to form several object layers. Hence character strings with different illuminations, nontext objects and background components are segmented into separate object layers. After performed text extraction process, the text objects with different sizes, styles and illuminations are properly extracted. Experimental results on the extraction of text strings from complex document images demonstrate the effectiveness of the proposed region-based segmentation method.
Keywords :
document image processing; feature extraction; image matching; image segmentation; statistical analysis; text analysis; automatic multilevel thresholding; complex document image segmentation; discriminant analysis; localized histogram analysis; multi-layer clustering; multi-layer matching; text extraction process; text strings; Clustering methods; Control engineering; Data mining; Histograms; Image analysis; Image color analysis; Image segmentation; Image texture analysis; Lighting; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Systems, Man and Cybernetics, 2004 IEEE International Conference on
ISSN :
1062-922X
Print_ISBN :
0-7803-8566-7
Type :
conf
DOI :
10.1109/ICSMC.2004.1400809
Filename :
1400809
Link To Document :
بازگشت