DocumentCode :
2060632
Title :
Hand-printed Chinese character recognition via machine learning
Author :
Amin, Adnan ; Kim, Seung-Gwon ; Sammut, Claude
Author_Institution :
Sch. of Comput. Sci. & Eng., New South Wales Univ., Kensington, NSW, Australia
Volume :
1
fYear :
1997
fDate :
18-20 Aug 1997
Firstpage :
190
Abstract :
Recognition of Chinese characters has been an area of great interest for many years, and a large number of research papers and reports have already been published in this area. There are several major problems with Chinese character recognition: Chinese characters are distinct and ideographic, the character size is very large and a lot of structurally similar characters exist in the character set. Thus, classification criteria are difficult to generate. This paper presents a new technique for the recognition of hand-printed Chinese characters using machine learning C4.5. Conventional methods have relied on hand-constructed dictionaries which are tedious to construct and difficult to make tolerant to variation in writing styles. The paper also discusses Chinese character recognition using dominant point feature extraction and C4.5. The system was tested with 900 characters (each character has 40 samples) and the rate of recognition obtained was 84%
Keywords :
character recognition; feature extraction; learning (artificial intelligence); C4.5; character set; character size; classification criteria; dominant point feature extraction; hand-printed Chinese character recognition; machine learning; structurally similar characters; writing styles; Algorithm design and analysis; Character recognition; Computer science; Decision trees; Dictionaries; Feature extraction; Machine learning; Machine learning algorithms; System testing; Writing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
Type :
conf
DOI :
10.1109/ICDAR.1997.619839
Filename :
619839
Link To Document :
بازگشت