• DocumentCode
    3528265
  • Title

    Learning sequence kernels

  • Author

    Cortes, Corinna ; Mohri, Mehryar ; Rostamizadeh, Afshin

  • Author_Institution
    Google Res., New York, NY
  • fYear
    2008
  • fDate
    16-19 Oct. 2008
  • Firstpage
    2
  • Lastpage
    8
  • Abstract
    Kernel methods are used to tackle a variety of learning tasks including classification, regression, ranking, clustering, and dimensionality reduction. The appropriate choice of a kernel is often left to the user. But, poor selections may lead to a sub-optimal performance. Instead, sample points can be used to learn a kernel function appropriate for the task by selecting one out of a family of kernels determined by the user. This paper considers the problem of learning sequence kernel functions, an important problem for applications in computational biology, natural language processing, document classification and other text processing areas. For most kernel-based learning techniques, the kernels selected must be positive definite symmetric, which, for sequence data, are found to be rational kernels. We give a general formulation of the problem of learning rational kernels and prove that a large family of rational kernels can be learned efficiently using a simple quadratic program both in the context of support vector machines and kernel ridge regression. This improves upon previous work that generally results in a more costly semi-definite or quadratically constrained quadratic program. Furthermore, in the specific case of kernel ridge regression, we give an alternative solution based on a closed-form solution for the optimal kernel matrix. We also report results of experiments with our kernel learning techniques in classification and regression tasks.
  • Keywords
    biology computing; learning (artificial intelligence); natural language processing; support vector machines; text analysis; classification; clustering; computational biology; dimensionality reduction; document classification; kernel ridge regression; learning; natural language processing; optimal kernel matrix; ranking; regression; sequence kernels; support vector machines; text processing; Closed-form solution; Computational biology; Kernel; Machine learning; Natural language processing; Quadratic programming; Sequences; Support vector machines; Symmetric matrices; Text processing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Machine Learning for Signal Processing, 2008. MLSP 2008. IEEE Workshop on
  • Conference_Location
    Cancun
  • ISSN
    1551-2541
  • Print_ISBN
    978-1-4244-2375-0
  • Electronic_ISBN
    1551-2541
  • Type

    conf

  • DOI
    10.1109/MLSP.2008.4685446
  • Filename
    4685446