Title :
Integrated Segmentation and Recognition of Mixed Chinese/English Document
Author :
Xia, Yong ; Xiao, Bai-Hua ; Wang, Chun-Heng ; Dai, Ru-Wei
Author_Institution :
Chinese Acad. of Sci., Beijing
Abstract :
This paper presents a general frame to integrate segmentation and recognition and gives a novel method to identify lingual attribute of mixed Chinese/English characters. The outstanding performance of this method is as follows. First, a text- line rather than a character segment is regarded as a process unit. Second, multi-feature is adopted based on multi-phase segmentation. Third, two types of feedbacks, including from character recognition and from character feature statistic within a text-line, are adopted throughout the whole segmentation and recognition. Fourth, it is adaptive to the quality and genre of documents.
Keywords :
character recognition; document image processing; image recognition; image segmentation; character feature statistics; character recognition; integrated recognition; integrated segmentation; mixed Chinese/English characters; mixed Chinese/English document; multiphase segmentation; Automation; Character recognition; Engines; Feature extraction; Feedback; Intelligent systems; Laboratories; Natural languages; Optical character recognition software; Statistics;
Conference_Titel :
Document Analysis and Recognition, 2007. ICDAR 2007. Ninth International Conference on
Conference_Location :
Parana
Print_ISBN :
978-0-7695-2822-9
DOI :
10.1109/ICDAR.2007.4377006