• DocumentCode
    110742
  • Title

    Feature Space Independent Semi-Supervised Domain Adaptation via Kernel Matching

  • Author

    Min Xiao ; Yuhong Guo

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Temple Univ., Philadelphia, PA, USA
  • Volume
    37
  • Issue
    1
  • fYear
    2015
  • fDate
    Jan. 1 2015
  • Firstpage
    54
  • Lastpage
    66
  • Abstract
    Domain adaptation methods aim to learn a good prediction model in a label-scarce target domain by leveraging labeled patterns from a related source domain where there is a large amount of labeled data. However, in many practical domain adaptation learning scenarios, the feature distribution in the source domain is different from that in the target domain. In the extreme, the two distributions could differ completely when the feature representation of the source domain is totally different from that of the target domain. To address the problems of substantial feature distribution divergence across domains and heterogeneous feature representations of different domains, we propose a novel feature space independent semi-supervised kernel matching method for domain adaptation in this work. Our approach learns a prediction function on the labeled source data while mapping the target data points to similar source data points by matching the target kernel matrix to a submatrix of the source kernel matrix based on a Hilbert Schmidt Independence Criterion. We formulate this simultaneous learning and mapping process as a non-convex integer optimization problem and present a local minimization procedure for its relaxed continuous form. We evaluate the proposed kernel matching method using both cross domain sentiment classification tasks of Amazon product reviews and cross language text classification tasks of Reuters multilingual newswire stories. Our empirical results demonstrate that the proposed kernel matching method consistently and significantly outperforms comparison methods on both cross domain classification problems with homogeneous feature spaces and cross domain classification problems with heterogeneous feature spaces.
  • Keywords
    integer programming; learning (artificial intelligence); matrix algebra; minimisation; pattern classification; Hilbert Schmidt independence criterion; Reuters multilingual newswire stories; cross domain classification problems; cross language text classification tasks; feature space independent semisupervised domain adaptation; homogeneous feature spaces; kernel matching method; label-scarce target domain; local minimization procedure; nonconvex integer optimization problem; simultaneous learning and mapping process; source kernel matrix; Adaptation models; Kernel; Laplace equations; Manifolds; Minimization; Optimization; Training; Domain adaptation; heterogeneous feature spaces; kernel matching;
  • fLanguage
    English
  • Journal_Title
    Pattern Analysis and Machine Intelligence, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0162-8828
  • Type

    jour

  • DOI
    10.1109/TPAMI.2014.2343216
  • Filename
    6866177