• DocumentCode
    1556989
  • Title

    Exploiting Local Coherent Patterns for Unsupervised Feature Ranking

  • Author

    Huang, Qinghua ; Tao, Dacheng ; Li, Xuelong ; Jin, Lianwen ; Wei, Gang

  • Author_Institution
    Sch. of Electron. & Inf. Eng., South China Univ. of Technol., Guangzhou, China
  • Volume
    41
  • Issue
    6
  • fYear
    2011
  • Firstpage
    1471
  • Lastpage
    1482
  • Abstract
    Prior to pattern recognition, feature selection is often used to identify relevant features and discard irrelevant ones for obtaining improved analysis results. In this paper, we aim to develop an unsupervised feature ranking algorithm that evaluates features using discovered local coherent patterns, which are known as biclusters. The biclusters (viewed as submatrices) are discovered from a data matrix. These submatrices are used for scoring relevant features from two aspects, i.e., the interdependence of features and the separability of instances. The features are thereby ranked with respect to their accumulated scores from the total discovered biclusters before the pattern classification. Experimental results show that this proposed method can yield comparable or even better performance in comparison with the well-known Fisher score, Laplacian score, and variance score using three UCI data sets, well improve the results of gene expression data analysis using gene ontology annotation, and finally demonstrate its advantage of unsupervised feature ranking for high-dimensional data.
  • Keywords
    feature extraction; matrix algebra; pattern classification; unsupervised learning; biclusters; data matrix; feature selection; local coherent pattern; pattern classification; unsupervised feature ranking; Algorithm design and analysis; Clustering algorithms; Correlation; Feature extraction; Unsupervised learning; Bicluster score; feature selection; unsupervised learning;
  • fLanguage
    English
  • Journal_Title
    Systems, Man, and Cybernetics, Part B: Cybernetics, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1083-4419
  • Type

    jour

  • DOI
    10.1109/TSMCB.2011.2151256
  • Filename
    5887432