DocumentCode
3381341
Title
Extraction of text in images
Author
Malik, Rohit ; SeongAh, Chin
Author_Institution
Dept. of Electr. Eng. & Comput. Eng., New Jersey Inst. of Technol., Newark, NJ, USA
fYear
1999
fDate
1999
Firstpage
534
Lastpage
537
Abstract
In this paper we present a text segmentation technique that is useful in locating and extracting text blocks in images. The algorithm works without prior knowledge of the text orientation, size or font. It is designed to eliminate background image information and to highlight or identify the regions of the image that contain text. The algorithm uses the fact that text regions in an image may be identified by searching for several repeated instances of uniform gray intensity of approximately the same width. Combining this with the fact that the ratio of type-face stroke width to height is often fixed provides a useful technique for extracting text from images. Results of the application of this algorithm are presented
Keywords
feature extraction; image segmentation; background image information elimination; image; text block extraction; text block location; text segmentation technique; type-face stroke width/height ratio; uniform gray intensity; Clustering algorithms; Computer science; Computer vision; Data mining; Humans; Image segmentation; Layout; Read only memory; Shape; Smoothing methods;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Intelligence and Systems, 1999. Proceedings. 1999 International Conference on
Conference_Location
Bethesda, MD
Print_ISBN
0-7695-0446-9
Type
conf
DOI
10.1109/ICIIS.1999.810343
Filename
810343
Link To Document