Title :
AVT-NBL: an algorithm for learning compact and accurate naive Bayes classifiers from attribute value taxonomies and data
Author :
Zhang, Jun ; Honavar, Vasant
Author_Institution :
Dept. of Comput. Sci., Iowa State Univ., Ames, IA, USA
Abstract :
In many application domains, there is a need for learning algorithms that can effectively exploit attribute value taxonomies (AVT) - hierarchical groupings of attribute values - to learn compact, comprehensible, and accurate classifiers from data - including data that are partially specified. This paper describes AVT-NBL, a natural generalization of the naive Bayes learner (NBL), for learning classifiers from AVT and data. Our experimental results show that AVT-NBL is able to generate classifiers that are substantially more compact and more accurate than those produced by NBL on a broad range of data sets with different percentages of partially specified values. We also show that AVT-NBL is more efficient in its use of training data: AVT-NBL produces classifiers that outperform those produced by NBL using substantially fewer training examples.
Keywords :
Bayes methods; learning (artificial intelligence); pattern classification; AVT-NBL algorithm; attribute value taxonomies; classifier learning; naive Bayes classification; Application software; Artificial intelligence; Computer science; Data mining; Laboratories; Learning; Ontologies; Robustness; Taxonomy; Training data;
Conference_Titel :
Data Mining, 2004. ICDM '04. Fourth IEEE International Conference on
Print_ISBN :
0-7695-2142-8
DOI :
10.1109/ICDM.2004.10083