DocumentCode :
387538
Title :
An approach to machine learning of Chinese Pinyin-to-character conversion for small-memory application
Author :
Liu, Bing-quan ; Wang, Xiao-long
Author_Institution :
Sch. of Comput. Sci. & Technol., Harbin Inst. of Technol., China
Volume :
3
fYear :
2002
fDate :
2002
Firstpage :
1287
Abstract :
Chinese Pinyin-to-character conversion is used in Chinese character input through keyboard and Chinese speech recognition. The key of this kind of system is machine learning that fits system for specific user. In this paper, an effective approach of machine learning of Chinese Pinyin-to-character conversion for small-memory application is presented. The approach is based on iterative new word identification and word frequency increasing that results in more accurate segmentation of Chinese character gradually and satisfy the need of user finally. Applying proposed machine learning to Chinese character input system through keyboard improves accuracy of Pinyin-to-character conversion from 90% up to 98%. Such a system can run in very small memory (limited in 120 K) and satisfy the need of small-memory platform. With rapid development of digital appliances such as PDA, mobile telephone, intelligent refrigerator and etc., and with development of embedded operating system, Pinyin-to-character conversion presented in this paper has found its new application.
Keywords :
character recognition; learning (artificial intelligence); speech recognition; Chinese character; Chinese character input through keyboard; Chinese speech recognition; iterative new word identification; machine learning; pinyin-to-character conversion; small-memory application; Frequency; Home appliances; Iterative methods; Keyboards; Learning systems; Machine learning; Operating systems; Refrigeration; Speech recognition; Telephony;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Cybernetics, 2002. Proceedings. 2002 International Conference on
Print_ISBN :
0-7803-7508-4
Type :
conf
DOI :
10.1109/ICMLC.2002.1167411
Filename :
1167411
Link To Document :
بازگشت