• DocumentCode
    2260948
  • Title

    An Improved LAM Feature Selection Algorithm

  • Author

    Ren, Yong-gong ; Lin, Nan ; Sun, Yu-Qi

  • Author_Institution
    Sch. of Comput. & Inf. Technol., Liaoning Normal Univ., Dalian, China
  • fYear
    2010
  • fDate
    20-22 Aug. 2010
  • Firstpage
    35
  • Lastpage
    38
  • Abstract
    In text categorization, feature selection is an effective feature dimension-reduction methods. To solve the problems of unadaptable high original feature space dimension, too much irrelevance, data redundancy and difficulties in selecting a threshold, we propose an improved LAM feature selection algorithm (ILAMFS). Firstly, combining the gold segmentation and the LAM algorithm based on the characteristics and the category of the correlation analysis, filtering the original feature set, and retaining the feature selection with strong correlation and weak category. Secondly, with the improved LAM algorithm, weighted average and Jaccard coefficient of such thoughts feature subsets make redundancy filtering out redundant features. Finally, we obtain an approximate optimal feature subset. Experimental results show that this method is effective in data dimension on reduction, threshold selection and furthermore, in reducing the computation amount and precision in the feature selection.
  • Keywords
    feature extraction; text analysis; Jaccard coefficient; LAM algorithm; correlation analysis; data dimension; data redundancy; feature dimension reduction method; feature space dimension; feature subset; gold segmentation; improved LAM feature selection algorithm; text categorization; threshold selection; weighted average; Accuracy; Algorithm design and analysis; Approximation algorithms; Classification algorithms; Correlation; Redundancy; Text categorization; Correlation; Features Selection; Redundancy; Weighted Average;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Information Systems and Applications Conference (WISA), 2010 7th
  • Conference_Location
    Hohhot
  • Print_ISBN
    978-1-4244-8440-9
  • Type

    conf

  • DOI
    10.1109/WISA.2010.33
  • Filename
    5581379