DocumentCode
2923197
Title
Interval Data Clustering with Applications
Author
Peng, Wei ; Li, Tao
Author_Institution
Sch. of Comput. Sci., Florida Int. Univ., Miami, FL
fYear
2006
fDate
Nov. 2006
Firstpage
355
Lastpage
362
Abstract
Interval data is described by a group of variables, each of which contains a range of continuous values instead of the traditional single continuous or discrete value. Traditional data analysis simply replaces each interval by its representative (e.g., center or mean) and ignores the structure information of intervals. In this paper, we study the problem of clustering interval data using the modified or extended interval data dissimilarity measures. Our contributions are two-fold. First, we discuss various approaches for measuring the dissimilarities/distances between interval data, investigate the relations among them, and present a comprehensive experimental study on clustering interval data. We show that the extended interval data clustering achieves better performance than traditional ones and produces more meaningful and explanatory results. Second, we propose a two-stage approach for clustering interval data by exploiting the relations between the traditional distances and the modified distances. Experimental results show the effectiveness of our approach
Keywords
data analysis; data structures; pattern clustering; interval data clustering; interval data dissimilarity measures; traditional data analysis; Application software; Artificial intelligence; Clustering algorithms; Computational efficiency; Computer science; Data analysis; Euclidean distance; Histograms;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2006. ICTAI '06. 18th IEEE International Conference on
Conference_Location
Arlington, VA
ISSN
1082-3409
Print_ISBN
0-7695-2728-0
Type
conf
DOI
10.1109/ICTAI.2006.71
Filename
4031919
Link To Document