DocumentCode
2652537
Title
Quantifying Features Using False Nearest Neighbors: An Unsupervised Approach
Author
Filho, Jose Augusto Andrade ; Carvalho, Andre C P L F ; Mello, Rodrigo F. ; Alelyani, Salem ; Liu, Huan
Author_Institution
ICMC, USP, Sao Carlos, Brazil
fYear
2011
fDate
7-9 Nov. 2011
Firstpage
994
Lastpage
997
Abstract
Real-world datasets commonly present high dimensional data, which means an increased amount of information. However, this does not always imply an improvement in learning technique performance. Furthermore, some features may be correlated or add unexpected noise, thereby reducing data clustering performance. This has motivated the development of feature selection methods to find the most relevant subset of features to describe data. In this work, we focus on the problem of unsupervised feature selection. The main goal is to define a method to identify the number of features to select after sorting them based on some criterion. This task is done by means of the False Nearest Neighbor technique, which is rooted in chaos theory. Results have shown that this technique gives a good approximate number of features to select. When compared to other techniques, in most of the analyzed cases, it maintains the quality of the generated partitions while selecting fewer features.
Keywords
feature extraction; unsupervised learning; data clustering; false nearest neighbor; features selection; learning technique; quantifying feature; real world datasets; unsupervised feature selection; Chaos; Equations; Glass; Iris; Mutual information; Space vehicles; Time series analysis; Chaos Theory; Clustering; Machine Learning; Unsupervised Feature Selection;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
Conference_Location
Boca Raton, FL
ISSN
1082-3409
Print_ISBN
978-1-4577-2068-0
Electronic_ISBN
1082-3409
Type
conf
DOI
10.1109/ICTAI.2011.170
Filename
6103461
Link To Document