Title :
Clustering Based Classification in Data Mining Method Recommendation
Author :
Kazik, Ondrej ; Peskova, Klara ; Smid, Jakub ; Neruda, Roman
Author_Institution :
Fac. of Math. & Phys., Charles Univ., Prague, Czech Republic
Abstract :
With the growing amount of data available in today´s world, the emphasis is laid on the automatic configuration of data analysis - metal earning. This paper elaborates one of the metal earning sub problems, the data mining method recommendation. Based on a metric over the data features called metadata, we have proposed a solution exploiting clustering of datasets. The agglomerative algorithm is used to construct clustering over the metadata, and the average methods´ performance is computed in each cluster. The ranking of data mining methods is then deduced from the classification of a dataset to a particular cluster. The recommendation algorithm, which is implemented within our data mining multi-agent system, has been tested in various configurations, and the results of these experiments have been compared.
Keywords :
data analysis; data mining; learning (artificial intelligence); meta data; pattern classification; pattern clustering; recommender systems; agglomerative algorithm; clustering based classification; data mining method recommendation; data mining multiagent system; dataset classification; metadata; metalearning subproblem; Clustering algorithms; Data mining; Entropy; Error analysis; Measurement; Training; Metalearning; clustering; data mining; method recommendation;
Conference_Titel :
Machine Learning and Applications (ICMLA), 2013 12th International Conference on
Conference_Location :
Miami, FL
DOI :
10.1109/ICMLA.2013.148