Title :
Weight sharing on naive Bayes document model
Author :
Saito, Kazumi ; Nakano, Ryohei
Author_Institution :
NTT Commun. Sci. Labs., Kyoto, Japan
fDate :
31 July-4 Aug. 2005
Abstract :
In this paper, we study weight sharing on the naive Bayes document model. Firstly we consider splitting words into a relatively small number of groups such that words in each group have the same parameter value. This problem can be regarded as a probabilistic parameter sharing task. In this task, we formalize the problem in terms of maximum likelihood estimation, and then propose an algorithm for this purpose. Secondly we focus on an adaptive hyperparameter estimation problem based on prior distributions constructed by using such word groups. This problem can be regarded as a hyperparameter sharing task. In this task, we describe a framework and algorithm, which enables to derive the unique optimal solution in the context of leave-one-out cross validation. In our experiments using a benchmark document set called webkb, we show a series of simulation results using the proposed algorithms.
Keywords :
Bayes methods; data mining; document handling; maximum likelihood estimation; hyperparameter estimation; hyperparameter sharing; leave-one-out cross validation; maximum likelihood estimation; naive Bayes document model; probabilistic parameter sharing; webkb; Electronic mail; Frequency conversion; Laboratories; Maximum likelihood estimation; Neural networks; Polynomials; Robustness; Text mining; Vocabulary;
Conference_Titel :
Neural Networks, 2005. IJCNN '05. Proceedings. 2005 IEEE International Joint Conference on
Print_ISBN :
0-7803-9048-2
DOI :
10.1109/IJCNN.2005.1555895