DocumentCode :
1797567
Title :
Unsupervised robust Bayesian feature selection
Author :
Jianyong Sun ; Aimin Zhou
Author_Institution :
Sch. of Arts, Media & Comput. Games, Abertay Univ., Dundee, UK
fYear :
2014
fDate :
6-11 July 2014
Firstpage :
558
Lastpage :
564
Abstract :
In this paper, we proposed a generative graphical model for unsupervised robust feature selection. The model assumes that the data are independent and identically sampled from a finite mixture of Student-t distribution for dealing with outliers. The Student t-distribution works as the building block for robust clustering and outlier detection. Random variables that represent the features´ saliency are included in the model for feature selection. As a result, the model is expected to simultaneously realise unsupervised clustering, feature selection and outlier detection. The inference is carried out by a tree-structured variational Bayes (VB) algorithm. The feature selection capability is realised by estimating the feature saliencies associated with the features. The adoption of full Bayesian treatment in the model realises automatic model selection. Experimental studies showed that the developed algorithm compares favourably against existing unsupervised Bayesian feature selection algorithm in terms of commonly-used internal and external cluster validity indices on controlled experimental settings and benchmark data sets. The controlled experimental study also showed that the developed algorithm is capable of exposing the outliers and finding the optimal number of components (model selection) accurately.
Keywords :
Bayes methods; feature selection; pattern clustering; statistical distributions; trees (mathematics); VB algorithm; automatic model selection; commonly-used internal cluster; external cluster validity indices; feature selection capability; features saliency; generative graphical model; outlier detection; robust clustering; student-t distribution; tree-structured variational Bayes algorithms; unsupervised Bayesian feature selection algorithm; unsupervised clustering; unsupervised robust Bayesian feature selection; Bayes methods; Clustering algorithms; Data models; Educational institutions; Graphical models; Inference algorithms; Robustness;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Neural Networks (IJCNN), 2014 International Joint Conference on
Conference_Location :
Beijing
Print_ISBN :
978-1-4799-6627-1
Type :
conf
DOI :
10.1109/IJCNN.2014.6889514
Filename :
6889514
Link To Document :
بازگشت