• DocumentCode
    2145565
  • Title

    A Novel Short Merged Off-line Handwritten Chinese Character String Segmentation Algorithm Using Hidden Markov Model

  • Author

    Jiang, Zhiwei ; Ding, Xiaoqing ; Liu, Changsong ; Wang, Yanwei

  • Author_Institution
    Dept. of Electron. Eng., Tsinghua Univ., Beijing, China
  • fYear
    2011
  • fDate
    18-21 Sept. 2011
  • Firstpage
    668
  • Lastpage
    672
  • Abstract
    Hidden Markov model (called "HMM" for short) has been a widespread method to segment sequential data in speech recognition and DNA sequence analysis. According to the same principle, it can be also used in segmenting short merged off-line handwritten Chinese character strings, which is a tough issue but often met in practice. Because HMM is still not a common method in this field nowadays, in this paper, we will introduce a novel algorithm using HMM for the segmentation issue above. Eventually, this segmentation algorithm can achieve an applicable performance even when 3755 character classes are compressed into similar characters classes with only 1% amount of original ones, and it also shows an enormous potential of segmenting long text lines.
  • Keywords
    handwritten character recognition; hidden Markov models; image segmentation; optical character recognition; DNA sequence analysis; hidden Markov model; merged offline handwritten Chinese character string segmentation; sequential data; speech recognition; Algorithm design and analysis; Character recognition; Decoding; Handwriting recognition; Hidden Markov models; Merging; Training; HMM; merged handwritten Chinese characters; merging similar characters; string segmentation;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition (ICDAR), 2011 International Conference on
  • Conference_Location
    Beijing
  • ISSN
    1520-5363
  • Print_ISBN
    978-1-4577-1350-7
  • Electronic_ISBN
    1520-5363
  • Type

    conf

  • DOI
    10.1109/ICDAR.2011.140
  • Filename
    6065395