DocumentCode :
1943025
Title :
Ensemble Methods in the Clustering of String Patterns
Author :
Lourenço, André ; Fred, Ana
Author_Institution :
Inst. de Telecomun., Inst. Super. Tecnico, Lisbon
Volume :
1
fYear :
2005
fDate :
5-7 Jan. 2005
Firstpage :
143
Lastpage :
148
Abstract :
We address the problem of clustering of contour images from hardware tools based on string descriptions, in a comparative study of cluster combination techniques. Several clustering algorithms are addressed using both the hierarchical agglomerative concept and partitional approaches. In the later class of algorithms, we explore: an adaptation of the K-means algorithm to string patterns using the median string as cluster representative; the error-correcting parsing approach by Fu; and the very recent spectral clustering approach. These algorithms are applied using several dissimilarity measures, namely: minimum code length based measures; dissimilarity based on the concept of reduction in grammatical complexity; and error-correcting parsing. In a first instance, clustering algorithms are applied individually to the image data set, and results are evaluated in terms of the error rate, taking as ground truth known labeling of the data. In a second step, we combine multiple data partitions, that we call a clustering ensemble, using three state-of-the-art clustering combination techniques. Results show that combination methods lead in general to better data partitioning, as compared to ground truth information.
Keywords :
image processing; pattern clustering; cluster combination techniques; contour images; data partitioning; error-correcting parsing; k-means algorithm; string patterns clustering; Clustering algorithms; Error analysis; Hardware; Length measurement; Partitioning algorithms; Pattern analysis; Shape; Signal analysis; Speech analysis; Telecommunications;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Application of Computer Vision, 2005. WACV/MOTIONS '05 Volume 1. Seventh IEEE Workshops on
Conference_Location :
Breckenridge, CO
Print_ISBN :
0-7695-2271-8
Type :
conf
DOI :
10.1109/ACVMOT.2005.46
Filename :
4129473
Link To Document :
بازگشت