DocumentCode :
1994497
Title :
Generation of hierarchical dictionary for stroke-order free Kanji handwriting recognition based on substroke HMM
Author :
Nakai, Mitsuru ; Shimodaira, Hiroshi ; Sagayama, Shigeki
Author_Institution :
Graduate Sch. of Inf. Sci., Japan Adv. Inst. of Sci. & Technol., Ishikawa, Japan
fYear :
2003
fDate :
3-6 Aug. 2003
Firstpage :
514
Abstract :
This paper describes a method of generating a Kanji hierarchical structured dictionary for stroke-number and stroke-order free handwriting recognition based on sub-stroke HMM. In stroke-based methods, a large number of stroke-order variations can be easily expressed by just adding different stroke sequences to the dictionary and it is not necessary to train new reference patterns. The hierarchical structured dictionary has an advantage that thousands of stroke-order variations of Kanji characters can be produced using a small number of stroke-order rules defining Kanji parts. Moreover, the recognition speed is fast since common sequences are shared in a substroke network, even if the total number of stroke-order combinations becomes enormous practically. In experiments, 300 different stroke-order rules of Kanji parts were statistically chosen by using 60 writers´ handwritings of 1016 educational Kanji characters. By adding these new stroke-order rules to the dictionary, about 9000 variations of different stroke-orders were generated for 2965 JIS 1st level Kanji characters. As a result, we successfully improved the recognition accuracy from 82.6% to 90.2% for stroke-order free handwritings.
Keywords :
dictionaries; feature extraction; handwriting recognition; hidden Markov models; natural languages; Kanji character; Kanji structured dictionary; hidden Markov model; hierarchical dictionary; recognition speed; reference pattern; stroke sequence; stroke-number free handwriting recognition; stroke-order free Kanji handwriting recognition; substroke HMM; Character generation; Character recognition; Data mining; Decoding; Dictionaries; Handwriting recognition; Hidden Markov models; Information science; Statistical analysis; Text analysis;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 2003. Proceedings. Seventh International Conference on
Print_ISBN :
0-7695-1960-1
Type :
conf
DOI :
10.1109/ICDAR.2003.1227718
Filename :
1227718
Link To Document :
بازگشت