DocumentCode :
2013682
Title :
Integrated Segmentation and Recognition of Mixed Chinese/English Document
Author :
Xia, Yong ; Xiao, Bai-Hua ; Wang, Chun-Heng ; Dai, Ru-Wei
Author_Institution :
Chinese Acad. of Sci., Beijing
Volume :
2
fYear :
2007
fDate :
23-26 Sept. 2007
Firstpage :
704
Lastpage :
708
Abstract :
This paper presents a general frame to integrate segmentation and recognition and gives a novel method to identify lingual attribute of mixed Chinese/English characters. The outstanding performance of this method is as follows. First, a text- line rather than a character segment is regarded as a process unit. Second, multi-feature is adopted based on multi-phase segmentation. Third, two types of feedbacks, including from character recognition and from character feature statistic within a text-line, are adopted throughout the whole segmentation and recognition. Fourth, it is adaptive to the quality and genre of documents.
Keywords :
character recognition; document image processing; image recognition; image segmentation; character feature statistics; character recognition; integrated recognition; integrated segmentation; mixed Chinese/English characters; mixed Chinese/English document; multiphase segmentation; Automation; Character recognition; Engines; Feature extraction; Feedback; Intelligent systems; Laboratories; Natural languages; Optical character recognition software; Statistics;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
ISSN :
1520-5363
Print_ISBN :
978-0-7695-2822-9
Type :
conf
DOI :
10.1109/ICDAR.2007.4377006
Filename :
4377006
Link To Document :
بازگشت