DocumentCode
2145565
Title
A Novel Short Merged Off-line Handwritten Chinese Character String Segmentation Algorithm Using Hidden Markov Model
Author
Jiang, Zhiwei ; Ding, Xiaoqing ; Liu, Changsong ; Wang, Yanwei
Author_Institution
Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
fYear
2011
fDate
18-21 Sept. 2011
Firstpage
668
Lastpage
672
Abstract
Hidden Markov model (called "HMM" for short) has been a widespread method to segment sequential data in speech recognition and DNA sequence analysis. According to the same principle, it can be also used in segmenting short merged off-line handwritten Chinese character strings, which is a tough issue but often met in practice. Because HMM is still not a common method in this field nowadays, in this paper, we will introduce a novel algorithm using HMM for the segmentation issue above. Eventually, this segmentation algorithm can achieve an applicable performance even when 3755 character classes are compressed into similar characters classes with only 1% amount of original ones, and it also shows an enormous potential of segmenting long text lines.
Keywords
handwritten character recognition; hidden Markov models; image segmentation; optical character recognition; DNA sequence analysis; hidden Markov model; merged offline handwritten Chinese character string segmentation; sequential data; speech recognition; Algorithm design and analysis; Character recognition; Decoding; Handwriting recognition; Hidden Markov models; Merging; Training; HMM; merged handwritten Chinese characters; merging similar characters; string segmentation;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition (ICDAR), 2011 International Conference on
Conference_Location
Beijing
ISSN
1520-5363
Print_ISBN
978-1-4577-1350-7
Electronic_ISBN
1520-5363
Type
conf
DOI
10.1109/ICDAR.2011.140
Filename
6065395
Link To Document