Title :
An efficient method of evaluating the distance between two uncertain objects
Author :
Chen, Hongmei ; Wang, Lizhen ; Liu, Weiyi ; Xiao, Qing
Author_Institution :
Dept. of Comput. Sci. & Eng., Yunnan Univ., Kunming, China
Abstract :
When data mining techniques are applied to uncertain data, their uncertainty has to be considered to obtain high quality results. Usually, an uncertain object is described by a probability density function, a probability density function is approximated by a large amount of sample points, and the distance between two uncertain objects is expressed by the expected distance. Computing the expected distance is costly because it involves double integral using a large amount of sample points for two uncertain objects´ probability density functions. This is critical for some uncertain data mining techniques. In this paper, a simple and efficient formula of evaluating the distance between two uncertain objects is presented. We also give the application of the formula in nearest-neighbor classifying. Experiments with datasets based on UCI datasets and the plant dataset of “Three Parallel Rivers of Yunnan Protected Area” verify the formula is effective and efficient.
Keywords :
data mining; distance measurement; pattern classification; distance evaluation; nearest-neighbor classification; probability density function; uncertain data mining; uncertain object; Automatic control; Automation; Clustering algorithms; Computer science; Costs; Data mining; Information science; Probability density function; Rivers; Uncertainty;
Conference_Titel :
Control and Automation (ICCA), 2010 8th IEEE International Conference on
Conference_Location :
Xiamen
Print_ISBN :
978-1-4244-5195-1
Electronic_ISBN :
1948-3449
DOI :
10.1109/ICCA.2010.5524286