• DocumentCode
    2984486
  • Title

    Handling Ambiguity via Input-Output Kernel Learning

  • Author

    Xinxing Xu ; Tsang, Ivor W. ; Dong Xu

  • Author_Institution
    Sch. of Comput. Eng., Nanyang Technol. Univ., Singapore, Singapore
  • fYear
    2012
  • fDate
    10-13 Dec. 2012
  • Firstpage
    725
  • Lastpage
    734
  • Abstract
    Data ambiguities exist in many data mining and machine learning applications such as text categorization and image retrieval. For instance, it is generally beneficial to utilize the ambiguous unlabeled documents to learn a more robust classifier for text categorization under the semi-supervised learning setting. To handle general data ambiguities, we present a unified kernel learning framework named Input-Output Kernel Learning (IOKL). Based on our framework, we further propose a novel soft margin group sparse Multiple Kernel Learning (MKL) formulation by introducing a group kernel slack variable to each group of base input-output kernels. Moreover, an efficient block-wise coordinate descent algorithm with an analytical solution for the kernel combination coefficients is developed to solve the proposed formulation. We conduct comprehensive experiments on benchmark datasets for both semi-supervised learning and multiple instance learning tasks, and also apply our IOKL framework to a computer vision application called text-based image retrieval on the NUS-WIDE dataset. Promising results demonstrate the effectiveness of our proposed IOKL framework.
  • Keywords
    data mining; learning (artificial intelligence); operating system kernels; text analysis; NUS-WIDE dataset; ambiguous unlabeled documents; benchmark datasets; block wise coordinate descent algorithm; computer vision application; data mining; general data ambiguity; group kernel slack; input output kernel learning; kernel combination coefficients; kernel learning framework; machine learning application; multiple instance learning task; multiple kernel learning formulation; robust classifier; semisupervised learning setting; text based image retrieval; text categorization; Kernel; Linear programming; Semisupervised learning; Support vector machines; Training; Uncertainty; Vectors; Group Multiple Kernel Learning; Input-Output Kernel Learning; Multi-Instance Learning; Semi-supervised Learning; Text-based Image Retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4673-4649-8
  • Type

    conf

  • DOI
    10.1109/ICDM.2012.105
  • Filename
    6413856