Title :
A novel generic algorithm for cluster split iB-fold cross-validation
Author_Institution :
Sch. of Comput. & Math. Sci., Auckland Univ. of Technol., Auckland
Abstract :
This article proposes a point of view (sub-space modelling), a post-processing strategy (relating to sub-optimal model) and a pre-processing strategy involving a novel cross-validation algorithm in machine learning with a focus on small datasets. To overcome the limitations imposed by small datasets, this approach leverages human expertise in being able to visualise or conceptualize clusters at high level using e.g. proximity or similarity, without detailed analysis. Validation in case studies on human motion have shown that it is possible to achieve better modelling results using iB-fold and by leveraging understanding of the data set and the nature of validation.
Keywords :
learning (artificial intelligence); pattern clustering; statistical analysis; cluster split iB-fold cross-validation algorithm; machine learning algorithm; post-processing strategy; pre-processing strategy; suboptimal model; Algorithm design and analysis; Clustering algorithms; Humans; Machine learning; Machine learning algorithms; Neural networks; Prototypes; Software prototyping; Testing; Visualization; Cross-validation; iB-fold; leave-one-out; split cluster; sub-space modelling;
Conference_Titel :
Information Technology Interfaces, 2008. ITI 2008. 30th International Conference on
Conference_Location :
Dubrovnik
Print_ISBN :
978-953-7138-12-7
Electronic_ISBN :
1330-1012
DOI :
10.1109/ITI.2008.4588534