Title :
Use of the Hough transform to separate merged text/graphics in forms
Author_Institution :
Inst. for Inf. Technol., Daimler Benz AG Res. Center, Ulm, Germany
fDate :
30 Aug-3 Sep 1992
Abstract :
Presents a new method for the separation of merged text/form-structure components in forms. The technique described uses a modified version of the Hough transform to detect the structure of the form. The closed contours of the connected components are approximated by piecewise linear line segments. The parameters of the Hesse normal form of each line segment serve as input for the Hough transform. Compared to the vectorized boundary of characters, the lines of the form structure consist of appreciable more line segments with the same orientation and distance. So, the problem of the form structure detection in the database of line segments can be reduced to the detection of local peaks in the Hough space. Subsequent processing steps reconstruct the remaining contour fragments to characters
Keywords :
Hough transforms; character recognition; document image processing; image segmentation; Hesse normal form; Hough transform; character recognition; contour fragments; document processing; form readers; form structure detection; image segmentation; piecewise linear line segments; text-graphics separation; Computational efficiency; Databases; Ear; Graphics; Image analysis; Image segmentation; Information technology; Iterative algorithms; Optical filters; Search methods;
Conference_Titel :
Pattern Recognition, 1992. Vol.II. Conference B: Pattern Recognition Methodology and Systems, Proceedings., 11th IAPR International Conference on
Conference_Location :
The Hague
Print_ISBN :
0-8186-2915-0
DOI :
10.1109/ICPR.1992.201770