Title :
Reading newspaper text
Author :
Lam, Stephen W. ; Wang, Dacheng ; Srihari, Sargur N.
Author_Institution :
Dept. of Comput. Sci., State Univ. of New York, Buffalo, NY, USA
Abstract :
The authors describe a method for segmenting a newspaper page image into labeled macro components (blocks) and recognizing the content. Connected component analysis is used to segment a newspaper image into several rectangular blocks and to filter connected components into character and noncharacter components. Textural analysis is then used to classify the remaining noncharacter components into graphics and photographs. Experimental results indicate that these techniques work very well
Keywords :
computerised pattern recognition; computerised picture processing; document image processing; connected component analysis; graphics; newspaper page image; newspaper text; photographs; segmentation; textural analysis; Character recognition; Computer science; Filtering; Graphics; Gray-scale; Image analysis; Image recognition; Image segmentation; Labeling; Merging;
Conference_Titel :
Pattern Recognition, 1990. Proceedings., 10th International Conference on
Conference_Location :
Atlantic City, NJ
Print_ISBN :
0-8186-2062-5
DOI :
10.1109/ICPR.1990.118197