DocumentCode :
3312042
Title :
Selection of the Suitable Neighborhood Size Based on Bayesian Information Criterion
Author :
Shao, Chao ; Zhang, Bin
Author_Institution :
Sch. of Comput. & Inf. Eng., Henan Univ. of Finance & Econ., Zhengzhou, China
Volume :
2
fYear :
2010
fDate :
28-31 May 2010
Firstpage :
367
Lastpage :
371
Abstract :
To select a suitable neighborhood size for manifold learning algorithms efficiently, a new method based on BIC (Bayesian Information Criterion) is used in this paper. Due to the locally Euclidean property of the manifold, the PCA (Principal Component Analysis) reconstruction errors of the neighborhoods without shortcut edges remain small; however, those of the neighborhoods with shortcut edges are relatively quite large. So all the PCA reconstruction errors fall into two clusters when the neighborhood size is unsuitable, or one cluster when the neighborhood size is suitable, which can be detected by BIC. Concretely speaking, if the BIC value of the two-cluster solution is larger than that of the one-cluster solution, all the PCA reconstruction errors fall into two clusters, which means that the neighborhood size is unsuitable, otherwise which means that the neighborhood size is suitable. This method only requires running PCA and computing BIC, whose time complexities are relatively small, but not running the time-consuming manifold learning algorithm as those methods based on residual variance do, so this method is much more efficient than those methods based on residual variance. The effectivity of this method can be verified by experimental results well.
Keywords :
Bayes methods; learning (artificial intelligence); pattern clustering; principal component analysis; PCA reconstruction errors; bayesian information criterion; locally Euclidean property; manifold learning algorithms; principal component analysis; suitable neighborhood size selection; Bayesian methods; Chaos; Clustering algorithms; Conference management; Education; Engineering management; Finance; Financial management; Manifolds; Principal component analysis; BIC; K-means; manifold learning; residual variance; the neighborhood size;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computational Science and Optimization (CSO), 2010 Third International Joint Conference on
Conference_Location :
Huangshan, Anhui
Print_ISBN :
978-1-4244-6812-6
Electronic_ISBN :
978-1-4244-6813-3
Type :
conf
DOI :
10.1109/CSO.2010.128
Filename :
5532984
Link To Document :
بازگشت