DocumentCode :
2940305
Title :
GATS-C4.5: An Algorithm for Optimizing Features in Flow Classification
Author :
You Chen ; Lei Dai ; Xue-Qi Cheng
Author_Institution :
Graduate Univ. of Chinese, Beijing
fYear :
2008
fDate :
10-12 Jan. 2008
Firstpage :
466
Lastpage :
470
Abstract :
Flow classifier deals with huge amount of data, which contains irrelevant and redundant features causing slower training and testing process, higher resource consumption as well as poor classification accuracy. Optimizing features, therefore, is an important issue in flow classification. In this paper, we propose a wrapper feature selection algorithm GATS-C4.5 aiming at modeling lightweight flow classifier by (1) using hybrid genetic-tabu approach as search strategy to specify candidate subsets for evaluation; (2) using C4.5 algorithm as wrapper approach to obtain the optimum feature subset. We have examined the feasibility of our algorithm by conducting several experiments on flow datasets which were categorized as WWW, MAIL, P2P, etc. The experimental results show that classifier with our approach can greatly improve computational performance without negative impact on classification accuracy. Further more, our approach is able not only to have smaller resource consumption, but also to have higher classification accuracy than Naive Bayes method with Kernel density estimation after Fast Correlation-Based Filter (NBK-FCBF).
Keywords :
Bayes methods; correlation methods; data handling; genetic algorithms; search problems; GATS-C4.5; fast correlation-based filter; flow classification; genetic algorithm tabu search; kernel density estimation; naive Bayes method; wrapper feature selection algorithm; Accuracy; Computers; Filters; Kernel; Machine learning; Machine learning algorithms; Postal services; Quality of service; Testing; World Wide Web;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Consumer Communications and Networking Conference, 2008. CCNC 2008. 5th IEEE
Conference_Location :
Las Vegas, NV
Print_ISBN :
978-1-4244-1456-7
Electronic_ISBN :
978-1-4244-1457-4
Type :
conf
DOI :
10.1109/ccnc08.2007.110
Filename :
4446408
Link To Document :
بازگشت