• DocumentCode
    3591214
  • Title

    Deriving kernels from MLP probability estimators for large categorization problems

  • Author

    Titov, Ivan ; Henderson, James

  • Author_Institution
    Dept. of Comput. Sci., Geneva Univ., Switzerland
  • Volume
    2
  • fYear
    2005
  • Firstpage
    937
  • Abstract
    In multi-class categorization problems with a very large or unbounded number of classes, it is often not computationally feasible to train and/or test a kernel-based classifier. One solution is to use a fast computation to pre-select a subset of the classes for reranking with a kernel method, but even then tractability can be a problem. We investigate using trained multilayer perceptron probability estimators to derive appropriate kernels for such problems. We propose a kernel derivation method which is specifically designed for reranking problems, and a more efficient variant of this method which is specifically designed for neural networks with large numbers of output units. When applied to a neural network model of natural language parsing, these new methods achieve state-of-the-art performance which improves over the original model.
  • Keywords
    learning (artificial intelligence); multilayer perceptrons; neural nets; pattern classification; kernel-based classifier; multi-class categorization problems; natural language parsing; neural networks; trained multilayer perceptron probability estimators; Computational efficiency; Computer science; Data mining; Electronic mail; Kernel; Multilayer perceptrons; Natural languages; Neural networks; Proposals; Testing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
  • Print_ISBN
    0-7803-9048-2
  • Type

    conf

  • DOI
    10.1109/IJCNN.2005.1555978
  • Filename
    1555978