Title :
Hand-printed Chinese character recognition via machine learning
Author :
Amin, Adnan ; Kim, Seung-Gwon ; Sammut, Claude
Author_Institution :
Sch. of Comput. Sci. & Eng., New South Wales Univ., Kensington, NSW, Australia
Abstract :
Recognition of Chinese characters has been an area of great interest for many years, and a large number of research papers and reports have already been published in this area. There are several major problems with Chinese character recognition: Chinese characters are distinct and ideographic, the character size is very large and a lot of structurally similar characters exist in the character set. Thus, classification criteria are difficult to generate. This paper presents a new technique for the recognition of hand-printed Chinese characters using machine learning C4.5. Conventional methods have relied on hand-constructed dictionaries which are tedious to construct and difficult to make tolerant to variation in writing styles. The paper also discusses Chinese character recognition using dominant point feature extraction and C4.5. The system was tested with 900 characters (each character has 40 samples) and the rate of recognition obtained was 84%
Keywords :
character recognition; feature extraction; learning (artificial intelligence); C4.5; character set; character size; classification criteria; dominant point feature extraction; hand-printed Chinese character recognition; machine learning; structurally similar characters; writing styles; Algorithm design and analysis; Character recognition; Computer science; Decision trees; Dictionaries; Feature extraction; Machine learning; Machine learning algorithms; System testing; Writing;
Conference_Titel :
Document Analysis and Recognition, 1997., Proceedings of the Fourth International Conference on
Conference_Location :
Ulm
Print_ISBN :
0-8186-7898-4
DOI :
10.1109/ICDAR.1997.619839