DocumentCode :
2006107
Title :
New Insights into Learning Algorithms and Datasets
Author :
Lee, Jun Won ; Giraud-Carrier, Christophe
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT
fYear :
2008
fDate :
11-13 Dec. 2008
Firstpage :
135
Lastpage :
140
Abstract :
We report on three distinct experiments that provide new valuable insights into learning algorithms and datasets. We first describe two effective meta-features that significantly impact the predictive accuracy of a broad range of learning algorithms. We then introduce a new efficient meta-feature that measures the degree of hardness (or difficulty) of datasets and show that it is highly linearly correlated with predictive accuracy. Finally, we use the notion of classifier output difference to cluster learning algorithms and show that learning algorithms from different model classes may demonstrate highly similar behaviors.
Keywords :
learning (artificial intelligence); pattern classification; pattern clustering; classifier output difference; cluster learning algorithm; dataset hardness; meta learning; predictive accuracy; Accuracy; Application software; Clustering algorithms; Computer science; Data mining; Decision trees; Machine learning; Machine learning algorithms; Neural networks; Tree data structures; Algorithm Behavior; Dataset Characterization; Meta-features; Meta-learning;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-0-7695-3495-4
Type :
conf
DOI :
10.1109/ICMLA.2008.8
Filename :
4724966
Link To Document :
بازگشت