Title of article :
GAMoN: Discovering M-of-image hypotheses for text classification by a lattice-based Genetic Algorithm Original Research Article
Author/Authors :
Veronica L. Policicchio، نويسنده , , Adriana Pietramala، نويسنده , , Pasquale Rullo، نويسنده ,
Issue Information :
روزنامه با شماره پیاپی سال 2012
Pages :
35
From page :
61
To page :
95
Abstract :
While there has been a long history of rule-based text classifiers, to the best of our knowledge no M-of-N-based approach for text categorization has so far been proposed. In this paper we argue that M-of-N hypotheses are particularly suitable to model the text classification task because of the so-called “family resemblance” metaphor: “the members (i.e., documents) of a family (i.e., category) share some small number of features, yet there is no common feature among all of them. Nevertheless, they resemble each other”. Starting from this conjecture, we provide a sound extension of the M-of-N approach with negation and disjunction, called M-of-image, which enables to best fit the true structure of the data. Based on a thorough theoretical study, we show that the M-of-image hypothesis space has two partial orders that form complete lattices. GAMoN is the task-specific Genetic Algorithm (GA) which, by exploiting the lattice-based structure of the hypothesis space, efficiently induces accurate M-of-image hypotheses.
Journal title :
Artificial Intelligence
Serial Year :
2012
Journal title :
Artificial Intelligence
Record number :
1207924
Link To Document :
بازگشت