DocumentCode
1263682
Title
Balanced feature selection method for Internet traffic classification
Author
Liu, Zhe ; Liu, Quanwei
Author_Institution
Sch. of Soft Eng., South China Univ. of Technol., Guangzhou, China
Volume
1
Issue
2
fYear
2012
fDate
6/1/2012 12:00:00 AM
Firstpage
74
Lastpage
83
Abstract
In Internet traffic classification, the class imbalance problem is mainly addressed by adjusting the class distribution. In the meanwhile, feature selection is also a key factor evoking this problem. Therefore a new filter feature selection method called balanced feature selection (BFS) is proposed. Every feature is measured both locally and globally and then an optimal feature subset is selected by our search model. A certainty coefficient is presented to measure the correlation between a feature and a certain class locally. The symmetric uncertainty is utilised to measure a feature and all classes globally. Through experiments on two real traffic traces using three classification algorithms, BFS is compared with five existing feature selection methods. Results show that it outperforms others by more than 15.29% g-mean improvement. Classification results are averaged over all datasets and classifiers here, 59.54% g-mean, 86.35% Mauc and 91.42% overall accuracy are achieved, respectively, when it is used.
Keywords
Internet; pattern classification; telecommunication traffic; Internet traffic classification; Mauc; balanced feature selection method; class distribution; class imbalance problem; g-mean improvement; search model;
fLanguage
English
Journal_Title
Networks, IET
Publisher
iet
ISSN
2047-4954
Type
jour
DOI
10.1049/iet-net.2011.0049
Filename
6266779
Link To Document