• DocumentCode
    2652537
  • Title

    Quantifying Features Using False Nearest Neighbors: An Unsupervised Approach

  • Author

    Filho, Jose Augusto Andrade ; Carvalho, Andre C P L F ; Mello, Rodrigo F. ; Alelyani, Salem ; Liu, Huan

  • Author_Institution
    ICMC, USP, Sao Carlos, Brazil
  • fYear
    2011
  • fDate
    7-9 Nov. 2011
  • Firstpage
    994
  • Lastpage
    997
  • Abstract
    Real-world datasets commonly present high dimensional data, which means an increased amount of information. However, this does not always imply an improvement in learning technique performance. Furthermore, some features may be correlated or add unexpected noise, thereby reducing data clustering performance. This has motivated the development of feature selection methods to find the most relevant subset of features to describe data. In this work, we focus on the problem of unsupervised feature selection. The main goal is to define a method to identify the number of features to select after sorting them based on some criterion. This task is done by means of the False Nearest Neighbor technique, which is rooted in chaos theory. Results have shown that this technique gives a good approximate number of features to select. When compared to other techniques, in most of the analyzed cases, it maintains the quality of the generated partitions while selecting fewer features.
  • Keywords
    feature extraction; unsupervised learning; data clustering; false nearest neighbor; features selection; learning technique; quantifying feature; real world datasets; unsupervised feature selection; Chaos; Equations; Glass; Iris; Mutual information; Space vehicles; Time series analysis; Chaos Theory; Clustering; Machine Learning; Unsupervised Feature Selection;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Tools with Artificial Intelligence (ICTAI), 2011 23rd IEEE International Conference on
  • Conference_Location
    Boca Raton, FL
  • ISSN
    1082-3409
  • Print_ISBN
    978-1-4577-2068-0
  • Electronic_ISBN
    1082-3409
  • Type

    conf

  • DOI
    10.1109/ICTAI.2011.170
  • Filename
    6103461