DocumentCode :
2353113
Title :
Korean Text Detection and Binarization in Color Signboards
Author :
Toan Nguyen Dinh ; Park, Jonghyun ; Lee, Gueesang
Author_Institution :
Dept. of Electron. & Comput. Eng., Chonnam Nat. Univ., Gwangju
fYear :
2008
fDate :
23-25 July 2008
Firstpage :
235
Lastpage :
240
Abstract :
Text detection and binarization are very important steps in text understanding. Natural scene images bring new challenges to correctly extract interested text regions. In the paper, efficient text detection and binarization methods are used to extract Korean text from color signboards of shops. First, we detect the main Korean text in color signboards by combining the horizontal edge profile in the image and some knowledge of the Korean text specified in our application. Then the detected Korean text is segmented to Korean words by using the vertical edge profile. Finally, each Korean word is fed into binarization module which uses 3-means clustering method in L*a*b* color space to extract text from the background. The experimental results show that our method successfully extracts text with low complexity and therefore can be used in the mobile devices which have limited capability.
Keywords :
character recognition; image colour analysis; natural language processing; text analysis; 3-means clustering method; Korean text detection; Korean text extraction; Korean words; binarization module; color space; image horizontal edge profile; natural scene images; shop color signboards; text understanding; vertical edge profile; Clustering methods; Data mining; Gray-scale; Image color analysis; Image edge detection; Image segmentation; Information technology; Layout; Lighting; Natural languages; color signboard; text binarization; text detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Advanced Language Processing and Web Information Technology, 2008. ALPIT '08. International Conference on
Conference_Location :
Dalian Liaoning
Print_ISBN :
978-0-7695-3273-8
Type :
conf
DOI :
10.1109/ALPIT.2008.41
Filename :
4584373
Link To Document :
بازگشت