Title :
New Words Recognition Algorithm and Application Based on Micro-Blog Hot
Author :
Qing, Zhou ; Yewang, Chen
Abstract :
New word identification is one of the difficult problems of Chinese information processing. In order to improve the efficiency of new word recognition, this paper proposed a new method to identify new word based on micro-blog message´s characteristic. First of all, the micro-blog message is segmented by using N-Gram, then we filter the candidate strings to obtain the candidate words, finally we construct an objective function based on characteristics of micro-blog message to identify new word. Compared with other new word identification methods, the experimental results show that the method proposed in this paper can significantly improve the effect of Chinese new word identification.
Keywords :
Blogs; Computational linguistics; Dictionaries; Feature extraction; Information processing; Linear programming; Vocabulary; Chinese information processing; new word identification;
Conference_Titel :
Measuring Technology and Mechatronics Automation (ICMTMA), 2015 Seventh International Conference on
Conference_Location :
Nanchang, China
Print_ISBN :
978-1-4673-7142-1
DOI :
10.1109/ICMTMA.2015.173