DocumentCode :
1564280
Title :
A Combined Weight method in Automatic Classiflcation of Chinese Text
Author :
Liao, Shasha ; Jiang, Minghu
Author_Institution :
Dept. of Chinese Language, Tsinghua Univ., Beijing
Volume :
2
fYear :
2005
Firstpage :
625
Lastpage :
630
Abstract :
In this paper, we set a shielded level in a concept tree to use both the concept attributes from a semantic dictionary and the Chinese words to make the feature set. After comparing the weight theories and classification precise, of the eight methods, we give a new selection method, the CHI-MCOR weight method, which is derived from two normal methods which present well in our experiments. Our former experiment result shows that if we can set a proper shielded level, we can not only reduce the feature dimension but also improve the classification precise. The later result shows that the combined weight method makes a good balance between the fuzzy words which have a high occurrence and the dividing words which have a middle or low occurrence, and the classification precise is higher than any one of the weight methods
Keywords :
dictionaries; natural languages; word processing; Chinese text; automatic classification; combined weight method; semantic dictionary; Classification tree analysis; Cognitive science; Computational linguistics; Data mining; Dictionaries; Frequency; Functional analysis; Natural languages; Organizing; Reflection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks and Brain, 2005. ICNN&B '05. International Conference on
Conference_Location :
Beijing
Print_ISBN :
0-7803-9422-4
Type :
conf
DOI :
10.1109/ICNNB.2005.1614711
Filename :
1614711
Link To Document :
بازگشت