Title :
New Insights into Learning Algorithms and Datasets
Author :
Lee, Jun Won ; Giraud-Carrier, Christophe
Author_Institution :
Dept. of Comput. Sci., Brigham Young Univ., Provo, UT
Abstract :
We report on three distinct experiments that provide new valuable insights into learning algorithms and datasets. We first describe two effective meta-features that significantly impact the predictive accuracy of a broad range of learning algorithms. We then introduce a new efficient meta-feature that measures the degree of hardness (or difficulty) of datasets and show that it is highly linearly correlated with predictive accuracy. Finally, we use the notion of classifier output difference to cluster learning algorithms and show that learning algorithms from different model classes may demonstrate highly similar behaviors.
Keywords :
learning (artificial intelligence); pattern classification; pattern clustering; classifier output difference; cluster learning algorithm; dataset hardness; meta learning; predictive accuracy; Accuracy; Application software; Clustering algorithms; Computer science; Data mining; Decision trees; Machine learning; Machine learning algorithms; Neural networks; Tree data structures; Algorithm Behavior; Dataset Characterization; Meta-features; Meta-learning;
Conference_Titel :
Machine Learning and Applications, 2008. ICMLA '08. Seventh International Conference on
Conference_Location :
San Diego, CA
Print_ISBN :
978-0-7695-3495-4
DOI :
10.1109/ICMLA.2008.8