• DocumentCode
    3142593
  • Title

    A novel feature selection methodology based on outlier detection technologies

  • Author

    Chen, Gang ; Yuanli Cai ; Juan Shi

  • Author_Institution
    Dept. of Autom., Xi´´an Jiaotong Univ., Xi´´an, China
  • fYear
    2011
  • fDate
    27-29 Nov. 2011
  • Firstpage
    363
  • Lastpage
    368
  • Abstract
    Feature selection is becoming more and more important for natural language processing as well as knowledge engineering. In this paper, we induce a simple principle that if an attribute subset has more representativeness, then it should be more self-organized, as a result it should be more insensitive to artificially seeded noise points. Based on that, our novel methodology transforms feature selection problems into outlier detection problems. Because of the characteristics of outlier detection problems, our framework can achieve high tolerance of noises, sub-samplings, and even classification errors in training data sets, which are extraordinary features of our method. Moreover, to evaluate the performance of our method comprehensively, we compare our method with several state-of-the-art methods on a number of real-life data sets, and give all the experiment results, which show that our method can accomplish feature reduction tasks with really high accuracy as well as remarkably low computing complexity.
  • Keywords
    computational complexity; knowledge engineering; natural language processing; statistical analysis; computing complexity; feature reduction task; feature selection methodology; knowledge engineering; natural language processing; outlier detection technology; Argon; Complexity theory; Principal component analysis; Search problems; Sonar; Training data; Transforms; attribute subset evaluator; feature reduction; feature selection; unsupervised feature reduction;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
  • Conference_Location
    Tokushima
  • Print_ISBN
    978-1-61284-729-0
  • Type

    conf

  • DOI
    10.1109/NLPKE.2011.6138226
  • Filename
    6138226