DocumentCode
2832717
Title
A Multilayer Method of Text Feature Extraction Based on CILIN
Author
Li, Xin-fu ; Zhao, Lei-lei
Author_Institution
Fac. of Math. & Comput., Hebei Univ., Baoding
fYear
2008
fDate
Aug. 29 2008-Sept. 2 2008
Firstpage
48
Lastpage
52
Abstract
The feature extraction is the most critical technology of text categorization. The method of feature extraction from Chinese text based on CILIN is different from the conventional feature extraction, which uses two feature extraction methods. This method is good at dealing with synonyms and polysemes, and reducing the dimension. Firstly, it uses the method of feature extraction from Chinese text based on CILIN to analyze the meaning of key words. Secondly, use the mutual information to extract the feature, it can give the relation between class and lemma. The experiment results proposed that comprehend to the meaning of key words can distinctively improve the text classification precision.
Keywords
feature extraction; text analysis; CILIN; Chinese text; multilayer method; text categorization; text classification precision; text feature extraction; Feature extraction; Frequency; Mathematics; Mutual information; Niobium; Nonhomogeneous media; Statistics; Support vector machine classification; Support vector machines; Text categorization; CILIN; feature extraction; text categorization;
fLanguage
English
Publisher
ieee
Conference_Titel
Computer Science and Information Technology, 2008. ICCSIT '08. International Conference on
Conference_Location
Singapore
Print_ISBN
978-0-7695-3308-7
Type
conf
DOI
10.1109/ICCSIT.2008.57
Filename
4624831
Link To Document