DocumentCode :
153404
Title :
Graph Model Optimization Based Historical Chinese Character Segmentation Method
Author :
Jingning Ji ; Liangrui Peng ; Bohan Li
Author_Institution :
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear :
2014
fDate :
7-10 April 2014
Firstpage :
282
Lastpage :
286
Abstract :
Historical Chinese document recognition technology is important for digital library. However, historical Chinese character segmentation remains a difficult problem due to the complex structure of Chinese characters and various writing styles. This paper presents a novel method for historical Chinese character segmentation based on graph model. After a preliminary over-segmentation stage, the system applies a merging process. The candidate segmentation positions are denoted by the nodes of a graph, and the merging process is regarded as selecting an optimal path of the graph. The weight of edge in the graph is calculated by the cost function which considers geometric features and recognition confidence. Experimental results show that the proposed method is effective with a detection rate of 94.6% and an accuracy rate of 96.1% on a test set of practical historical Chinese document samples.
Keywords :
character recognition; character sets; digital libraries; document image processing; graph theory; image segmentation; merging; cost function; digital library; geometric features; graph model optimization; historical Chinese character segmentation method; historical Chinese document recognition technology; merging process; Accuracy; Character recognition; Cost function; Image edge detection; Image segmentation; Merging; Text analysis; character segmentation; graph model; historical Chinese document;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis Systems (DAS), 2014 11th IAPR International Workshop on
Conference_Location :
Tours
Print_ISBN :
978-1-4799-3243-6
Type :
conf
DOI :
10.1109/DAS.2014.57
Filename :
6831014
Link To Document :
بازگشت