Title of article :
A discrete mixture-based kernel for SVMs: Application to spam and image categorization
Author/Authors :
Nizar Bouguila، نويسنده , , Ola Amayri، نويسنده ,
Issue Information :
دوماهنامه با شماره پیاپی سال 2009
Pages :
12
From page :
631
To page :
642
Abstract :
In this paper, we investigate the problem of training support vector machines (SVMs) on count data. Multinomial Dirichlet mixture models allow us to model efficiently count data. On the other hand, SVMs permit good discrimination. We propose, then, a hybrid model that appropriately combines their advantages. Finite mixture models are introduced, as an SVM kernel, to incorporate prior knowledge about the nature of data involved in the problem at hand. For the learning of our mixture model, we propose a deterministic annealing component-wise EM algorithm mixed with a minimum description length type criterion. In the context of this model, we compare different kernels. Through some applications involving spam and image database categorization, we find that our data-driven kernel performs better.
Keywords :
MDL , SPAM , Image database , SVM , Maximum likelihood , EM , CEMM , Deterministic annealing , finite mixture models , Multinomial dirichlet , Kernels
Journal title :
Information Processing and Management
Serial Year :
2009
Journal title :
Information Processing and Management
Record number :
1228988
Link To Document :
بازگشت