Title of article
Learner excellence biased by data set selection: A case for data characterisation and artificial data sets
Author/Authors
Macià، نويسنده , , Nْria and Bernadَ-Mansilla، نويسنده , , Ester and Orriols-Puig، نويسنده , , Albert and Kam Ho، نويسنده , , Tin، نويسنده ,
Issue Information
روزنامه با شماره پیاپی سال 2013
Pages
13
From page
1054
To page
1066
Abstract
The excellence of a given learner is usually claimed through a performance comparison with other learners over a collection of data sets. Too often, researchers are not aware of the impact of their data selection on the results. Their test beds are small, and the selection of the data sets is not supported by any previous data analysis. Conclusions drawn on such test beds cannot be generalised, because particular data characteristics may favour certain learners unnoticeably. This work raises these issues and proposes the characterisation of data sets using complexity measures, which can be helpful for both guiding experimental design and explaining the behaviour of learners.
Keywords
Supervised learning , Learner assessment , data complexity
Journal title
PATTERN RECOGNITION
Serial Year
2013
Journal title
PATTERN RECOGNITION
Record number
1735294
Link To Document