• DocumentCode
    1994497
  • Title

    Generation of hierarchical dictionary for stroke-order free Kanji handwriting recognition based on substroke HMM

  • Author

    Nakai, Mitsuru ; Shimodaira, Hiroshi ; Sagayama, Shigeki

  • Author_Institution
    Graduate Sch. of Inf. Sci., Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
  • fYear
    2003
  • fDate
    3-6 Aug. 2003
  • Firstpage
    514
  • Abstract
    This paper describes a method of generating a Kanji hierarchical structured dictionary for stroke-number and stroke-order free handwriting recognition based on sub-stroke HMM. In stroke-based methods, a large number of stroke-order variations can be easily expressed by just adding different stroke sequences to the dictionary and it is not necessary to train new reference patterns. The hierarchical structured dictionary has an advantage that thousands of stroke-order variations of Kanji characters can be produced using a small number of stroke-order rules defining Kanji parts. Moreover, the recognition speed is fast since common sequences are shared in a substroke network, even if the total number of stroke-order combinations becomes enormous practically. In experiments, 300 different stroke-order rules of Kanji parts were statistically chosen by using 60 writers´ handwritings of 1016 educational Kanji characters. By adding these new stroke-order rules to the dictionary, about 9000 variations of different stroke-orders were generated for 2965 JIS 1st level Kanji characters. As a result, we successfully improved the recognition accuracy from 82.6% to 90.2% for stroke-order free handwritings.
  • Keywords
    dictionaries; feature extraction; handwriting recognition; hidden Markov models; natural languages; Kanji character; Kanji structured dictionary; hidden Markov model; hierarchical dictionary; recognition speed; reference pattern; stroke sequence; stroke-number free handwriting recognition; stroke-order free Kanji handwriting recognition; substroke HMM; Character generation; Character recognition; Data mining; Decoding; Dictionaries; Handwriting recognition; Hidden Markov models; Information science; Statistical analysis; Text analysis;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
  • Print_ISBN
    0-7695-1960-1
  • Type

    conf

  • DOI
    10.1109/ICDAR.2003.1227718
  • Filename
    1227718