DocumentCode :
2834323
Title :
A Probability Based Approach for Processing Dimension Missing Data
Author :
Cheng, Yu ; Zhang, Tao
Author_Institution :
Dept. of Biomed. Eng., Tsinghua Univ., Beijing, China
fYear :
2009
fDate :
19-20 Dec. 2009
Firstpage :
1
Lastpage :
4
Abstract :
Processing missing value is one of the most important task in data mining. A great many applications, such as social commercial record, biological systems and remote sensing network, in which not only data values from particular features but even data dimension information may also be missing. Such missing values are known as dimension missing values-standard operation over these data may result in unrepresentable or uncertain problems. To tackle this problem of dealing with dimension missing data, in this paper, we first propose a probabilistic model to managing such data. Then, instead of enumerating all possible cases to recover the missed dimensions, we develop an effective and efficient bound confidence approach to speed up the retrieval process. A concrete evaluation using real data sets is reported, which shows that our method is effective and efficient on dimension incomplete data.
Keywords :
data mining; information retrieval; probability; bound confidence approach; data mining; dimension missing data processing; missing value processing; probability based approach; retrieval process; Automation; Biological systems; Biomedical engineering; Cleaning; Concrete; Data analysis; Data mining; Information retrieval; Remote sensing; Testing;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
Type :
conf
DOI :
10.1109/ICIECS.2009.5364328
Filename :
5364328
Link To Document :
بازگشت