DocumentCode :
3142593
Title :
A novel feature selection methodology based on outlier detection technologies
Author :
Chen, Gang ; Yuanli Cai ; Juan Shi
Author_Institution :
Dept. of Autom., Xi´´an Jiaotong Univ., Xi´´an, China
fYear :
2011
fDate :
27-29 Nov. 2011
Firstpage :
363
Lastpage :
368
Abstract :
Feature selection is becoming more and more important for natural language processing as well as knowledge engineering. In this paper, we induce a simple principle that if an attribute subset has more representativeness, then it should be more self-organized, as a result it should be more insensitive to artificially seeded noise points. Based on that, our novel methodology transforms feature selection problems into outlier detection problems. Because of the characteristics of outlier detection problems, our framework can achieve high tolerance of noises, sub-samplings, and even classification errors in training data sets, which are extraordinary features of our method. Moreover, to evaluate the performance of our method comprehensively, we compare our method with several state-of-the-art methods on a number of real-life data sets, and give all the experiment results, which show that our method can accomplish feature reduction tasks with really high accuracy as well as remarkably low computing complexity.
Keywords :
computational complexity; knowledge engineering; natural language processing; statistical analysis; computing complexity; feature reduction task; feature selection methodology; knowledge engineering; natural language processing; outlier detection technology; Argon; Complexity theory; Principal component analysis; Search problems; Sonar; Training data; Transforms; attribute subset evaluator; feature reduction; feature selection; unsupervised feature reduction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Natural Language Processing andKnowledge Engineering (NLP-KE), 2011 7th International Conference on
Conference_Location :
Tokushima
Print_ISBN :
978-1-61284-729-0
Type :
conf
DOI :
10.1109/NLPKE.2011.6138226
Filename :
6138226
Link To Document :
بازگشت