Title :
Deriving kernels from MLP probability estimators for large categorization problems
Author :
Titov, Ivan ; Henderson, James
Author_Institution :
Dept. of Comput. Sci., Geneva Univ., Switzerland
Abstract :
In multi-class categorization problems with a very large or unbounded number of classes, it is often not computationally feasible to train and/or test a kernel-based classifier. One solution is to use a fast computation to pre-select a subset of the classes for reranking with a kernel method, but even then tractability can be a problem. We investigate using trained multilayer perceptron probability estimators to derive appropriate kernels for such problems. We propose a kernel derivation method which is specifically designed for reranking problems, and a more efficient variant of this method which is specifically designed for neural networks with large numbers of output units. When applied to a neural network model of natural language parsing, these new methods achieve state-of-the-art performance which improves over the original model.
Keywords :
learning (artificial intelligence); multilayer perceptrons; neural nets; pattern classification; kernel-based classifier; multi-class categorization problems; natural language parsing; neural networks; trained multilayer perceptron probability estimators; Computational efficiency; Computer science; Data mining; Electronic mail; Kernel; Multilayer perceptrons; Natural languages; Neural networks; Proposals; Testing;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1555978