DocumentCode
12823
Title
Unsupervised Hybrid Feature Extraction Selection for High-Dimensional Non-Gaussian Data Clustering with Variational Inference
Author
Wentao Fan ; Bouguila, N. ; Ziou, Djemel
Author_Institution
Concordia Inst. for Inf. Syst. Eng., Concordia Univ., Montreal, QC, Canada
Volume
25
Issue
7
fYear
2013
fDate
Jul-13
Firstpage
1670
Lastpage
1685
Abstract
Clustering has been a subject of extensive research in data mining, pattern recognition, and other areas for several decades. The main goal is to assign samples, which are typically non-Gaussian and expressed as points in high-dimensional feature spaces, to one of a number of clusters. It is well known that in such high-dimensional settings, the existence of irrelevant features generally compromises modeling capabilities. In this paper, we propose a variational inference framework for unsupervised non-Gaussian feature selection, in the context of finite generalized Dirichlet (GD) mixture-based clustering. Under the proposed principled variational framework, we simultaneously estimate, in a closed form, all the involved parameters and determine the complexity (i.e., both model an feature selection) of the GD mixture. Extensive simulations using synthetic data along with an analysis of real-world data and human action videos demonstrate that our variational approach achieves better results than comparable techniques.
Keywords
Gaussian processes; data handling; data mining; feature extraction; inference mechanisms; pattern clustering; unsupervised learning; GD mixture based clustering; data mining; generalized Dirichlet mixture; high dimensional Non-Gaussian data clustering; nonGaussian feature selection; pattern recognition; unsupervised hybrid feature extraction selection; variational inference; Approximation methods; Bayesian methods; Data mining; Data models; Feature extraction; Vectors; Bayesian estimation; Mixture models; feature selection; generalized Dirichlet; human action videos; model selection; unsupervised learning; variational inference;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2012.101
Filename
6200275
Link To Document