DocumentCode :
509056
Title :
A Novel Two-Phase Method for the Classification of Incomplete Data
Author :
Qu, Xiuyun ; Yuan, Bo ; Liu, Wenhuang
Author_Institution :
Grad. Sch. Shenzhen, Tsinghua Univ., Shenzhen, China
Volume :
3
fYear :
2009
fDate :
26-27 Dec. 2009
Firstpage :
452
Lastpage :
455
Abstract :
The issue of incomplete data exists across the entire field of data mining. In this paper, a novel two-phase method is developed to deal with the challenge of incomplete data on classification problems. In phase I, the dataset is divided into disjoint subsets based on the attributes with missing values. In phase II, each subset is used to train appropriate classification algorithms respectively in parallel. Experimental results show that the proposed scheme works favorably compared to other techniques on both synthesized and real data sets.
Keywords :
data mining; pattern classification; data mining; data sets; feature deletion; incomplete data classification; missing values; two-phase method; Accidents; Blood; Classification algorithms; Data mining; Industrial engineering; Information management; Innovation management; Loss measurement; Machine learning; Testing; classification; feature deletion; imputation; incomplete data; missing values;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Management, Innovation Management and Industrial Engineering, 2009 International Conference on
Conference_Location :
Xi´an
Print_ISBN :
978-0-7695-3876-1
Type :
conf
DOI :
10.1109/ICIII.2009.418
Filename :
5369135
Link To Document :
بازگشت