• DocumentCode
    12823
  • Title

    Unsupervised Hybrid Feature Extraction Selection for High-Dimensional Non-Gaussian Data Clustering with Variational Inference

  • Author

    Wentao Fan ; Bouguila, N. ; Ziou, Djemel

  • Author_Institution
    Concordia Inst. for Inf. Syst. Eng., Concordia Univ., Montreal, QC, Canada
  • Volume
    25
  • Issue
    7
  • fYear
    2013
  • fDate
    Jul-13
  • Firstpage
    1670
  • Lastpage
    1685
  • Abstract
    Clustering has been a subject of extensive research in data mining, pattern recognition, and other areas for several decades. The main goal is to assign samples, which are typically non-Gaussian and expressed as points in high-dimensional feature spaces, to one of a number of clusters. It is well known that in such high-dimensional settings, the existence of irrelevant features generally compromises modeling capabilities. In this paper, we propose a variational inference framework for unsupervised non-Gaussian feature selection, in the context of finite generalized Dirichlet (GD) mixture-based clustering. Under the proposed principled variational framework, we simultaneously estimate, in a closed form, all the involved parameters and determine the complexity (i.e., both model an feature selection) of the GD mixture. Extensive simulations using synthetic data along with an analysis of real-world data and human action videos demonstrate that our variational approach achieves better results than comparable techniques.
  • Keywords
    Gaussian processes; data handling; data mining; feature extraction; inference mechanisms; pattern clustering; unsupervised learning; GD mixture based clustering; data mining; generalized Dirichlet mixture; high dimensional Non-Gaussian data clustering; nonGaussian feature selection; pattern recognition; unsupervised hybrid feature extraction selection; variational inference; Approximation methods; Bayesian methods; Data mining; Data models; Feature extraction; Vectors; Bayesian estimation; Mixture models; feature selection; generalized Dirichlet; human action videos; model selection; unsupervised learning; variational inference;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2012.101
  • Filename
    6200275