DocumentCode :
126870
Title :
Kernel learning method for distance-based classification of categorical data
Author :
Lifei Chen ; Gongde Guo ; Shengrui Wang ; Xiangzeng Kong
Author_Institution :
Sch. of Math. & Comput. Sci., Fujian Normal Univ., Fuzhou, China
fYear :
2014
fDate :
8-10 Sept. 2014
Firstpage :
1
Lastpage :
7
Abstract :
Kernel-based methods have become popular in machine learning; however, they are typically designed for numeric data. These methods are established in vector spaces, which are undefined for categorical data. In this paper, we propose a new kind of kernel trick, showing that mapping of categorical samples into kernel spaces can be alternatively described as assigning a kernel-based weight to each categorical attribute of the input space, so that common distance measures can be employed. A data-driven approach is then proposed to kernel bandwidth selection by optimizing feature weights. We also make use of the kernel-based distance measure to effectively extend nearest-neighbor classification to classify categorical data. Experimental results on real-world data sets show the outstanding performance of this approach compared to that obtained in the original input space.
Keywords :
learning (artificial intelligence); pattern classification; categorical attribute; categorical data classification; categorical samples; common distance measures; data-driven approach; distance-based classification; feature weight optimization; input space; kernel bandwidth selection; kernel learning method; kernel spaces; kernel-based distance measure; kernel-based weight; machine learning; nearest-neighbor classification; numeric data; vector spaces; Bandwidth; Educational institutions; Kernel; Optimization; Training; Vectors; Weight measurement;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Intelligence (UKCI), 2014 14th UK Workshop on
Conference_Location :
Bradford
Type :
conf
DOI :
10.1109/UKCI.2014.6930159
Filename :
6930159
Link To Document :
بازگشت