DocumentCode
2984486
Title
Handling Ambiguity via Input-Output Kernel Learning
Author
Xinxing Xu ; Tsang, Ivor W. ; Dong Xu
Author_Institution
Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
fYear
2012
fDate
10-13 Dec. 2012
Firstpage
725
Lastpage
734
Abstract
Data ambiguities exist in many data mining and machine learning applications such as text categorization and image retrieval. For instance, it is generally beneficial to utilize the ambiguous unlabeled documents to learn a more robust classifier for text categorization under the semi-supervised learning setting. To handle general data ambiguities, we present a unified kernel learning framework named Input-Output Kernel Learning (IOKL). Based on our framework, we further propose a novel soft margin group sparse Multiple Kernel Learning (MKL) formulation by introducing a group kernel slack variable to each group of base input-output kernels. Moreover, an efficient block-wise coordinate descent algorithm with an analytical solution for the kernel combination coefficients is developed to solve the proposed formulation. We conduct comprehensive experiments on benchmark datasets for both semi-supervised learning and multiple instance learning tasks, and also apply our IOKL framework to a computer vision application called text-based image retrieval on the NUS-WIDE dataset. Promising results demonstrate the effectiveness of our proposed IOKL framework.
Keywords
data mining; learning (artificial intelligence); operating system kernels; text analysis; NUS-WIDE dataset; ambiguous unlabeled documents; benchmark datasets; block wise coordinate descent algorithm; computer vision application; data mining; general data ambiguity; group kernel slack; input output kernel learning; kernel combination coefficients; kernel learning framework; machine learning application; multiple instance learning task; multiple kernel learning formulation; robust classifier; semisupervised learning setting; text based image retrieval; text categorization; Kernel; Linear programming; Semisupervised learning; Support vector machines; Training; Uncertainty; Vectors; Group Multiple Kernel Learning; Input-Output Kernel Learning; Multi-Instance Learning; Semi-supervised Learning; Text-based Image Retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining (ICDM), 2012 IEEE 12th International Conference on
Conference_Location
Brussels
ISSN
1550-4786
Print_ISBN
978-1-4673-4649-8
Type
conf
DOI
10.1109/ICDM.2012.105
Filename
6413856
Link To Document