Title :
Assessing the performance of a swarm-based biclustering technique for data imputation
Author :
Veroneze, Rosana ; de França, Fabrício O. ; Von Zuben, Fernando J.
Author_Institution :
Dept. of Comput. Eng. & Ind. Autom. DCA, Univ. of Campinas Unicamp, Campinas, Brazil
Abstract :
Although the missing data problem has been studied for many years, it is still a relevant and challenging problem nowadays. Data can be missing for a variety of reasons, and there are several techniques capable of processing missing data. A parcel of them tries to estimate the missing values. This technique is called imputation. Recently, it was proposed a biclustering algorithm, based on Swarm Intelligence, named SwarmBCluster, to impute missing data. As it is a novel and promising algorithm, this paper intends to investigate the influence of its parameters on the performance. To achieve this objective, this paper will compare SwarmBCluster with other two imputation algorithms and, after that, it will perform a sensitivity analysis. The quality of the imputations is measured with the Root Mean Squared Error (RMSE). The experiments showed that SwarmBCluster presents good results concerning the RMSE metric and that the proper choice of parameters can considerably improve the performance of the algorithm.
Keywords :
data handling; mean square error methods; optimisation; pattern clustering; SwarmBCluster; ant colony optimization; data imputation; missing data problem; performance assessment; root mean squared error; sensitivity analysis; swarm intelligence; swarm-based biclustering technique; Algorithm design and analysis; Coherence; Data models; Databases; Filtering; Mathematical model; Measurement; Missing data; ant colony optimization; biclustering; data imputation;
Conference_Titel :
Evolutionary Computation (CEC), 2011 IEEE Congress on
Conference_Location :
New Orleans, LA
Print_ISBN :
978-1-4244-7834-7
DOI :
10.1109/CEC.2011.5949644