• DocumentCode
    1399044
  • Title

    Visualized Analysis of Mixed Numeric and Categorical Data Via Extended Self-Organizing Map

  • Author

    Chung-Chian Hsu ; Shu-Han Lin

  • Author_Institution
    Dept. of Inf. Manage., Nat. Yunlin Univ. of Sci. & Technol., Yunlin, Taiwan
  • Volume
    23
  • Issue
    1
  • fYear
    2012
  • Firstpage
    72
  • Lastpage
    86
  • Abstract
    Many real-world datasets are of mixed types, having numeric and categorical attributes. Even though difficult, analyzing mixed-type datasets is important. In this paper, we propose an extended self-organizing map (SOM), called MixSOM, which utilizes a data structure distance hierarchy to facilitate the handling of numeric and categorical values in a direct, unified manner. Moreover, the extended model regularizes the prototype distance between neighboring neurons in proportion to their map distance so that structures of the clusters can be portrayed better on the map. Extensive experiments on several synthetic and real-world datasets are conducted to demonstrate the capability of the model and to compare MixSOM with several existing models including Kohonen´s SOM, the generalized SOM and visualization-induced SOM. The results show that MixSOM is superior to the other models in reflecting the structure of the mixed-type data and facilitates further analysis of the data such as exploration at various levels of granularity.
  • Keywords
    data analysis; data visualisation; self-organising feature maps; Kohonen SOM; MixSOM; data structure distance hierarchy; extended selforganizing map; generalized SOM; mixed numeric-categorical data; mixed-type datasets; neighboring neurons; visualization-induced SOM; visualized analysis; Aerospace electronics; Algorithm design and analysis; Data visualization; Neurons; Numerical models; Prototypes; Training; Categorical data analysis; clustering methods; mixed-type data analysis; self-organizing map; visualization;
  • fLanguage
    English
  • Journal_Title
    Neural Networks and Learning Systems, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    2162-237X
  • Type

    jour

  • DOI
    10.1109/TNNLS.2011.2178323
  • Filename
    6104220