DocumentCode
2853595
Title
Hybrid Chinese/English text identification in Web images
Author
Jiaying He ; Shaofa Li
fYear
2004
fDate
18-20 Dec. 2004
Firstpage
361
Lastpage
364
Abstract
In this paper, a novel algorithm is presented for hybrid Chinese/English text location, segmentation and Chinese character reconstruction in Web images. Since Web images have certain characteristics that distinguish them from conventional complex background images, most text segmentation algorithms with good performance in other fields fail to recognize Web images text. This paper proposes an algorithm that aims to locate and segment hybrid text in Web images, and to retrieve complete Chinese characters using a novel character reconstruction algorithm. Experimental result shows that our approach has high text detection rate and fast processing speed when identifying Web image text, and has promising result in segmentation of oriental text symbols such as Chinese, Japanese and Korea characters.
Keywords
Internet; character recognition; image reconstruction; image segmentation; text analysis; Chinese character reconstruction; Web image text; complex background image; hybrid Chinese-English text identification; text detection; text segmentation algorithm; Character recognition; Graphics; Helium; Image analysis; Image color analysis; Image reconstruction; Image retrieval; Image segmentation; Text analysis; World Wide Web;
fLanguage
English
Publisher
ieee
Conference_Titel
Image and Graphics (ICIG'04), Third International Conference on
Conference_Location
Hong Kong, China
Print_ISBN
0-7695-2244-0
Type
conf
DOI
10.1109/ICIG.2004.78
Filename
1410459
Link To Document