Title :
Boundary feature extraction from gray-scale document images
Author :
Nishida, Hirobumi
Author_Institution :
Res. & Dev. Center, Ricoh Co. Ltd., Yokohama, Japan
Abstract :
A novel method is presented for extracting closed boundaries of document components such as characters and symbols directly from gray-scale document images based on the surface data structures along with structural features. The method is based on the simple model assuming that a closed boundary of document components can be approximated as a series of horizontal line segments and can be extracted by linking surface components with steep gradients which share commonly intersecting horizontal planes. The proposed algorithm is compared with some binarization algorithms, shown to be effective for improving recognition accuracy for very poor quality data
Keywords :
data structures; document image processing; edge detection; feature extraction; binarization algorithms; boundary feature extraction; characters; closed boundary extraction; commonly intersecting horizontal planes; document components; gray-scale document images; horizontal line segments; recognition accuracy; steep gradients; structural features; surface data structures; symbols; very poor quality data; Data mining; Data structures; Digital images; Feature extraction; Gray-scale; Image analysis; Image recognition; Optical character recognition software; Optical devices; Optical distortion;
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
DOI :
10.1109/ICDAR.1997.619828