DocumentCode
442102
Title
Weighted kernel functions for SVM learning in string domains: a distance function viewpoint
Author
Vanschoenwinkel ; Liu, Feng ; Manderick, Bernard
Author_Institution
Dept. of Informatics, Vrije Univ. Brussel, Belgium
Volume
7
fYear
2005
fDate
18-21 Aug. 2005
Firstpage
4226
Abstract
This paper extends the idea of weighted distance functions to kernels and support vector machines. Here, we focus on applications that rely on sliding a window over a sequence of string data. For this type of problems it is argued that a symbolic, context-based representation of the data should be preferred over a continuous, real format as this is a much more intuitive setting for working with (weighted) distance functions. It is shown how a weighted string distance can be decomposed and subsequently used in different kernel functions and how these kernel functions correspond to inner products between real vectors. As a case-study named entity recognition is used with information gain ratio as a weighting scheme.
Keywords
data structures; learning (artificial intelligence); sequences; string matching; support vector machines; SVM learning; data representation; information gain ratio; named entity recognition; string data sequence; string domain; symbolic context-based representation; weighted distance function; weighted kernel functions; Computational modeling; Cybernetics; Discrete transforms; Informatics; Kernel; Machine learning; Position measurement; Space technology; Support vector machines; In formation Gain Ratio; Kernel Functions; Metrics; Named Entity Recognition; Support Vector Machines;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location
Guangzhou, China
Print_ISBN
0-7803-9091-1
Type
conf
DOI
10.1109/ICMLC.2005.1527679
Filename
1527679
Link To Document