Title :
A Probability Based Approach for Processing Dimension Missing Data
Author :
Cheng, Yu ; Zhang, Tao
Author_Institution :
Dept. of Biomed. Eng., Tsinghua Univ., Beijing, China
Abstract :
Processing missing value is one of the most important task in data mining. A great many applications, such as social commercial record, biological systems and remote sensing network, in which not only data values from particular features but even data dimension information may also be missing. Such missing values are known as dimension missing values-standard operation over these data may result in unrepresentable or uncertain problems. To tackle this problem of dealing with dimension missing data, in this paper, we first propose a probabilistic model to managing such data. Then, instead of enumerating all possible cases to recover the missed dimensions, we develop an effective and efficient bound confidence approach to speed up the retrieval process. A concrete evaluation using real data sets is reported, which shows that our method is effective and efficient on dimension incomplete data.
Keywords :
data mining; information retrieval; probability; bound confidence approach; data mining; dimension missing data processing; missing value processing; probability based approach; retrieval process; Automation; Biological systems; Biomedical engineering; Cleaning; Concrete; Data analysis; Data mining; Information retrieval; Remote sensing; Testing;
Conference_Titel :
Information Engineering and Computer Science, 2009. ICIECS 2009. International Conference on
Conference_Location :
Wuhan
Print_ISBN :
978-1-4244-4994-1
DOI :
10.1109/ICIECS.2009.5364328