• DocumentCode
    2210262
  • Title

    Transfer Learning on Heterogenous Feature Spaces via Spectral Transformation

  • Author

    Shi, Xiaoxiao ; Liu, Qi ; Fan, Wei ; Yu, Philip S. ; Zhu, Ruixin

  • Author_Institution
    Dept. of Comput. Sci., Univ. of Illinois at Chicago, Chicago, IL, USA
  • fYear
    2010
  • fDate
    13-17 Dec. 2010
  • Firstpage
    1049
  • Lastpage
    1054
  • Abstract
    Labeled examples are often expensive and time-consuming to obtain. One practically important problem is: can the labeled data from other related sources help predict the target task, even if they have (a) different feature spaces (e.g., image vs. text data), (b) different data distributions, and (c) different output spaces? This paper proposes a solution and discusses the conditions where this is possible and highly likely to produce better results. It works by first using spectral embedding to unify the different feature spaces of the target and source data sets, even when they have completely different feature spaces. The principle is to cast into an optimization objective that preserves the original structure of the data, while at the same time, maximizes the similarity between the two. Second, a judicious sample selection strategy is applied to select only those related source examples. At last, a Bayesian-based approach is applied to model the relationship between different output spaces. The three steps can bridge related heterogeneous sources in order to learn the target task. Among the 12 experiment data sets, for example, the images with wavelet-transformed-based features are used to predict another set of images whose features are constructed from color-histogram space. By using these extracted examples from heterogeneous sources, the models can reduce the error rate by as much as ~50%, compared with the methods using only the examples from the target task.
  • Keywords
    Bayes methods; data mining; feature extraction; image colour analysis; learning (artificial intelligence); optimisation; spectral analysis; wavelet transforms; Bayesian-based approach; color histogram space; data distribution; heterogeneous feature space; image data; labeled data; spectral transformation; text data; wavelet transform; feature space; heterogeneous; spectral; transfer learning;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining (ICDM), 2010 IEEE 10th International Conference on
  • Conference_Location
    Sydney, NSW
  • ISSN
    1550-4786
  • Print_ISBN
    978-1-4244-9131-5
  • Electronic_ISBN
    1550-4786
  • Type

    conf

  • DOI
    10.1109/ICDM.2010.65
  • Filename
    5694083