Title :
A classification method of Vietnamese news events based on maximum entropy model
Author :
Li-juan, Zhu ; Feng, Zhou ; Qing-qing, Pan ; Xin, Yan ; Zheng-tao, Yu
Author_Institution :
Institute of Information Engineering and Automation, Kunming University of Science and Technology, Kunming 650500, China
Abstract :
Based on the characteristics of Vietnamese news texts and selection of Vietnamese words and phrases, part of speech, named entities, news titles, keywords and so on, this paper proposes a classification method of Vietnamese news events based on the maximum entropy model. This method selects the named entities and news keywords in Vietnamese news titles and the event trigger words in the key sentences corresponding to the keywords as the news classification features, and adopts the maximum entropy model to achieve classification. Furthermore, this paper collects more than 6000 Vietnamese news texts, which are marked into seven kinds of news event corpora such as politics, economy and culture, etc. and subject to training, and then obtains classification model of Vietnamese news texts and achieves type classification of Vietnamese news events. The experimental results show that the accuracy of the classification method of Vietnamese news events proposed in this paper reaches 96.97%.
Keywords :
Entropy; Feature extraction; Speech; Tagging; Text categorization; Training; Vietnamese; Vietnamese news classification; feature selection; machine learning; maximum entropy;
Conference_Titel :
Control Conference (CCC), 2015 34th Chinese
Conference_Location :
Hangzhou, China
DOI :
10.1109/ChiCC.2015.7260253