• DocumentCode
    3637030
  • Title

    Influence of distance measure choice on the results of hierarchical clustering

  • Author

    S. Pinjušić;M. Vranić

  • Author_Institution
    Faculty of Electrical Engineering and Computing, University of Zagreb, Unska 3, Croatia
  • fYear
    2010
  • Firstpage
    1256
  • Lastpage
    1261
  • Abstract
    Hierarchical clustering method is used to assign observations into clusters which are further connected to form a hierarchical structure. Observations in the same cluster are close together according to the predetermined distance measure while observations belonging to distinct clusters are afar. This paper presents an implementation of specific distance measure used to calculate distances between observations which are described by a mixture of variable types. Data mining tool ‘Orange’ offers ways to program new modules in an efficient manner so it was used for implementation, testing, data processing and result visualization. Finally, a comparison was made between results obtained by previously available widget and output of newly programmed widget which employed new variable types and new distance measure.
  • Keywords
    "Data mining","Data visualization","Libraries","Clustering methods","Testing","Databases","Graphical user interfaces","Shape measurement","Electric variables measurement","Data processing"
  • Publisher
    ieee
  • Conference_Titel
    MIPRO, 2010 Proceedings of the 33rd International Convention
  • Print_ISBN
    978-1-4244-7763-0
  • Type

    conf

  • Filename
    5533662