DocumentCode
3219040
Title
An efficient method of evaluating the distance between two uncertain objects
Author
Chen, Hongmei ; Wang, Lizhen ; Liu, Weiyi ; Xiao, Qing
Author_Institution
Dept. of Comput. Sci. & Eng., Yunnan Univ., Kunming, China
fYear
2010
fDate
9-11 June 2010
Firstpage
1259
Lastpage
1264
Abstract
When data mining techniques are applied to uncertain data, their uncertainty has to be considered to obtain high quality results. Usually, an uncertain object is described by a probability density function, a probability density function is approximated by a large amount of sample points, and the distance between two uncertain objects is expressed by the expected distance. Computing the expected distance is costly because it involves double integral using a large amount of sample points for two uncertain objects´ probability density functions. This is critical for some uncertain data mining techniques. In this paper, a simple and efficient formula of evaluating the distance between two uncertain objects is presented. We also give the application of the formula in nearest-neighbor classifying. Experiments with datasets based on UCI datasets and the plant dataset of “Three Parallel Rivers of Yunnan Protected Area” verify the formula is effective and efficient.
Keywords
data mining; distance measurement; pattern classification; distance evaluation; nearest-neighbor classification; probability density function; uncertain data mining; uncertain object; Automatic control; Automation; Clustering algorithms; Computer science; Costs; Data mining; Information science; Probability density function; Rivers; Uncertainty;
fLanguage
English
Publisher
ieee
Conference_Titel
Control and Automation (ICCA), 2010 8th IEEE International Conference on
Conference_Location
Xiamen
ISSN
1948-3449
Print_ISBN
978-1-4244-5195-1
Electronic_ISBN
1948-3449
Type
conf
DOI
10.1109/ICCA.2010.5524286
Filename
5524286
Link To Document