Title :
Chinese chunking based on Naive Bayes model
Author :
Sun, Guang-Lu ; Lang, Fei ; Liu, Changxing ; Chen, Zhifeng
Author_Institution :
Harbin Univ. of Sci. & Technol., Harbin, China
Abstract :
A new Chinese chunking algorithm is proposed based on Naive Bayes model and semantic features. Through the analysis of Chinese chunking task, Naive Bayes model that combines different types of features were applied for its rapid performance of training and test. Semantic features were utilized to further improve the accuracy. Experimental results on the Chinese chunking corpus of Chinese Penn Treebank show that the algorithm achieves impressive accuracy of 92.8% in terms of the F-score.
Keywords :
Bayes methods; natural language processing; text analysis; Chinese Penn Treebank; Chinese chunking algorithm; Naive Bayes model; natural language processing; semantic features; text analysis; text chunking; Computational modeling; Hidden Markov models; IP networks; Silicon; Training; Chinese chunking; Naïve bayes; Semantic features;
Conference_Titel :
Strategic Technology (IFOST), 2010 International Forum on
Conference_Location :
Ulsan
Print_ISBN :
978-1-4244-9038-7
Electronic_ISBN :
978-1-4244-9036-3
DOI :
10.1109/IFOST.2010.5667958