• DocumentCode
    1943025
  • Title

    Ensemble Methods in the Clustering of String Patterns

  • Author

    Lourenço, André ; Fred, Ana

  • Author_Institution
    Inst. de Telecomun., Inst. Super. Tecnico, Lisbon
  • Volume
    1
  • fYear
    2005
  • fDate
    5-7 Jan. 2005
  • Firstpage
    143
  • Lastpage
    148
  • Abstract
    We address the problem of clustering of contour images from hardware tools based on string descriptions, in a comparative study of cluster combination techniques. Several clustering algorithms are addressed using both the hierarchical agglomerative concept and partitional approaches. In the later class of algorithms, we explore: an adaptation of the K-means algorithm to string patterns using the median string as cluster representative; the error-correcting parsing approach by Fu; and the very recent spectral clustering approach. These algorithms are applied using several dissimilarity measures, namely: minimum code length based measures; dissimilarity based on the concept of reduction in grammatical complexity; and error-correcting parsing. In a first instance, clustering algorithms are applied individually to the image data set, and results are evaluated in terms of the error rate, taking as ground truth known labeling of the data. In a second step, we combine multiple data partitions, that we call a clustering ensemble, using three state-of-the-art clustering combination techniques. Results show that combination methods lead in general to better data partitioning, as compared to ground truth information.
  • Keywords
    image processing; pattern clustering; cluster combination techniques; contour images; data partitioning; error-correcting parsing; k-means algorithm; string patterns clustering; Clustering algorithms; Error analysis; Hardware; Length measurement; Partitioning algorithms; Pattern analysis; Shape; Signal analysis; Speech analysis; Telecommunications;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Application of Computer Vision, 2005. WACV/MOTIONS '05 Volume 1. Seventh IEEE Workshops on
  • Conference_Location
    Breckenridge, CO
  • Print_ISBN
    0-7695-2271-8
  • Type

    conf

  • DOI
    10.1109/ACVMOT.2005.46
  • Filename
    4129473