Title :
Improving classification performance on real data through imputation
Author :
Bratu, C. Vidrighin ; Muresan, T. ; Potolea, R.
Author_Institution :
Tech. Univ. of Cluj-Napoca, Cluj-Napoca
Abstract :
The applicability of learning methods to raw data coming from different areas of human activity is one of the main concerns in data mining research today. This paper emphasizes the need for a sound preprocessing method to improve the quality of the learning process through data imputation. Three classification methods we have previously developed are presented, with a focus on their evaluations. The results prove their increased performance on benchmark data, when compared to similar approaches. Although on real-world data improvements have been observed as well, the case study presented here has revealed the need for a reliable preprocessing method, to enhance the performance of the methods on real, incomplete data. We have carried out preliminary evaluations, on benchmark data, with a new imputation method, based on an ensemble of neural networks.
Keywords :
classification; data mining; learning (artificial intelligence); neural nets; classification; data imputation; data mining; learning; neural networks; sound preprocessing; Artificial neural networks; Classification algorithms; Costs; Data mining; Humans; Learning systems; Machine learning; Medical diagnostic imaging; Medical tests; Testing;
Conference_Titel :
Automation, Quality and Testing, Robotics, 2008. AQTR 2008. IEEE International Conference on
Conference_Location :
Cluj-Napoca
Print_ISBN :
978-1-4244-2576-1
Electronic_ISBN :
978-1-4244-2577-8
DOI :
10.1109/AQTR.2008.4588965