Title :
Choosing the initial set of exemplars when learning with an NGE-based system
Author :
Figueira, Lucas Baggio ; Nicoletti, Maria Do Carmo
Author_Institution :
Dept. of Comput. Sci., Univ. Fed. de Sao Carlos, Brazil
Abstract :
In the original proposal of the NGE (nested generalized exemplar) system, the induction of a concept is based on an initial set of training examples (named seeds) that are randomly chosen. The number of examples in this set is arbitrary, generally determined by the user of the system. It can be seen empirically, that the final results are influenced by the initial choice of the seeds. We propose and investigate other alternative methods for choosing seeds and empirically evaluate their impact on the learning results in seven knowledge domains, as far as accuracy and number of expressions describing the concepts are concerned. In spite of the additional time investment associated with using a clustering method and, assuming that accuracy of the induced concept is of major importance, experiments have shown that choosing the initial set of seeds as the center of clusters can be the best option.
Keywords :
learning (artificial intelligence); pattern clustering; NGE-based system; clustering method; initial set; nested generalized exemplar system; Clustering methods; Computer science; Euclidean distance; Information technology; Investments; Machine learning; Machine learning algorithms; Nearest neighbor searches; Neural networks; Proposals;
Conference_Titel :
Information Technology: Coding and Computing, 2004. Proceedings. ITCC 2004. International Conference on
Print_ISBN :
0-7695-2108-8
DOI :
10.1109/ITCC.2004.1286630