DocumentCode :
2493259
Title :
A cluster number specification-free algorithm in networks intrusion detection
Author :
Xiao, Lizhong ; Wu, Qingtao
Author_Institution :
Dept. of Comput. Sci. & Inf. Eng., Shanghai Inst. of Technol., Shanghai
fYear :
2008
fDate :
25-27 June 2008
Firstpage :
6272
Lastpage :
6276
Abstract :
With the crucial problem of specifying cluster number in clustering algorithm, a cluster number specification-free algorithm, F-CMSVM, is proposed in this paper. Firstly, the data set is classified into two clusters by Fuzzy C-means algorithm (FCM). Then the result is tested by Support Vector Machine (SVM) associated with a fuzzy membership function to confirm whether the data set could be classified. Finally, the process is repeated and the clustering result can be obtained. With this unsupervised algorithm, not only does the training data set need no labeling, but also the cluster number needs no specifying. Experiments over networks connection records from KDD CUP 1999 data set were implemented to evaluate the proposed method. To obtain an appropriate training data set and overcome the low efficiency in processing the high dimensional data set, a cross method and a feature selection algorithm based on mutual information were applied respectively in experiments. The result clearly shows the outstanding performance of the proposed method in decision of cluster number and effect of intrusion detection.
Keywords :
computer networks; fuzzy set theory; number theory; pattern classification; pattern clustering; security of data; support vector machines; unsupervised learning; cluster number specification-free algorithm; feature selection algorithm; fuzzy c-means algorithm; fuzzy membership function; high dimensional data set processing; mutual information; network intrusion detection; support vector machine; unsupervised algorithm; Automation; Clustering algorithms; Fuzzy sets; Intelligent control; Intrusion detection; Labeling; Partitioning algorithms; Support vector machine classification; Support vector machines; Training data; Fuzzy C-means algorithm; Support Vector Machine; cluster number; intrusion detection;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Intelligent Control and Automation, 2008. WCICA 2008. 7th World Congress on
Conference_Location :
Chongqing
Print_ISBN :
978-1-4244-2113-8
Electronic_ISBN :
978-1-4244-2114-5
Type :
conf
DOI :
10.1109/WCICA.2008.4593874
Filename :
4593874
Link To Document :
بازگشت