DocumentCode
1994497
Title
Generation of hierarchical dictionary for stroke-order free Kanji handwriting recognition based on substroke HMM
Author
Nakai, Mitsuru ; Shimodaira, Hiroshi ; Sagayama, Shigeki
Author_Institution
Graduate Sch. of Inf. Sci., Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
fYear
2003
fDate
3-6 Aug. 2003
Firstpage
514
Abstract
This paper describes a method of generating a Kanji hierarchical structured dictionary for stroke-number and stroke-order free handwriting recognition based on sub-stroke HMM. In stroke-based methods, a large number of stroke-order variations can be easily expressed by just adding different stroke sequences to the dictionary and it is not necessary to train new reference patterns. The hierarchical structured dictionary has an advantage that thousands of stroke-order variations of Kanji characters can be produced using a small number of stroke-order rules defining Kanji parts. Moreover, the recognition speed is fast since common sequences are shared in a substroke network, even if the total number of stroke-order combinations becomes enormous practically. In experiments, 300 different stroke-order rules of Kanji parts were statistically chosen by using 60 writers´ handwritings of 1016 educational Kanji characters. By adding these new stroke-order rules to the dictionary, about 9000 variations of different stroke-orders were generated for 2965 JIS 1st level Kanji characters. As a result, we successfully improved the recognition accuracy from 82.6% to 90.2% for stroke-order free handwritings.
Keywords
dictionaries; feature extraction; handwriting recognition; hidden Markov models; natural languages; Kanji character; Kanji structured dictionary; hidden Markov model; hierarchical dictionary; recognition speed; reference pattern; stroke sequence; stroke-number free handwriting recognition; stroke-order free Kanji handwriting recognition; substroke HMM; Character generation; Character recognition; Data mining; Decoding; Dictionaries; Handwriting recognition; Hidden Markov models; Information science; Statistical analysis; Text analysis;
fLanguage
English
Publisher
ieee
Conference_Titel
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN
0-7695-1960-1
Type
conf
DOI
10.1109/ICDAR.2003.1227718
Filename
1227718
Link To Document