Title :
A novel higher-order semantic kernel for text classification
Author :
Altlnel, Berna ; Ganiz, Murat Can ; Diri, B.
Author_Institution :
Dept. of Comput. Eng., Marmara Univ., Istanbul, Turkey
Abstract :
In conventional text categorization algorithms, documents are symbolized as “bag of words” (BOW) with the fact that documents are supposed to be independent from each other. While this approach simplifies the models, it ignores the semantic information between terms of each document. In this study, we develop a novel method to measure semantic similarity based on higher-order dependencies between documents. We propose a kernel for Support Vector Machines (SVM) algorithm using these dependencies which is called Higher-Order Semantic Kernel. With the aim of presenting comparative performance of Higher-Order Semantic Kernel we performed many experiments not only with our algorithm but also with existing traditional first-order kernels such as Polynomial Kernel, Radial Basis Function Kernel, and Linear Kernel. The experiments using Higher-Order Semantic Kernel on several well-known datasets show that classification performance improves significantly over the first-order methods.
Keywords :
pattern classification; support vector machines; text analysis; BOW; SVM algorithm; bag of words; higher-order dependencies; higher-order semantic kernel; semantic information; semantic similarity measurement; support vector machines algorithm; text categorization algorithms; text classification; Accuracy; Kernel; Polynomials; Semantics; Support vector machines; Text categorization; Training; Machine learning; higher order paths; semantic kernel; support vector machine; text classification;
Conference_Titel :
Electronics, Computer and Computation (ICECCO), 2013 International Conference on
Conference_Location :
Ankara
DOI :
10.1109/ICECCO.2013.6718267