Title :
A kernel-based feature weighting for text classification
Author :
Wittek, Peter ; Tan, Chew Lim
Author_Institution :
Dept. of Comput. Sci., Nat. Univ. of Singapore, Singapore, Singapore
Abstract :
Text classification by support vector machines can benefit from semantic smoothing kernels that regard semantic relations among index terms while computing similarity. Adding expansion terms to the vector representation can also improve effectiveness. However, existing semantic smoothing kernels do not employ term expansion. This paper proposes a new non-linear kernel for text classification to exploit semantic relations between terms to add weighted expansion terms.
Keywords :
classification; computational linguistics; feature extraction; support vector machines; text analysis; vocabulary; index term; nonlinear kernel-based feature weighting; semantic smoothing; support vector machine; text classification; vector representation; Casting; Computer networks; Kernel; Neural networks; Smoothing methods; Support vector machine classification; Support vector machines; Text categorization; Thesauri; Training data;
Conference_Titel :
Neural Networks, 2009. IJCNN 2009. International Joint Conference on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-3548-7
Electronic_ISBN :
1098-7576
DOI :
10.1109/IJCNN.2009.5179022