DocumentCode :
3112166
Title :
Machine learning based internet traffic recognition with statistical approach
Author :
Jaiswal, Rupesh Chandrakant ; Lokhande, S.D.
Author_Institution :
Dept. of Electron. & Telecommun., Pune Inst. of Comput. Technol., Pune, India
fYear :
2013
fDate :
13-15 Dec. 2013
Firstpage :
1
Lastpage :
6
Abstract :
The researchers have started looking for Internet traffic recognition techniques that are independent of `well known´ TCP or UDP port numbers, or interpreting the contents of packet payloads. Newer approaches classify traffic by recognizing statistical patterns in externally observable attributes of the traffic (such as typical packet lengths and inter-arrival times). The main goal is to cluster or classify the Internet traffic flows into groups that have identical statistical properties. The need to deal with Traffic patterns, large datasets and Multidimensional spaces of flow and packet attributes is one of the reasons for the introduction of Machine Learning (ML) techniques in this field. ML techniques are subset of Artificial Intelligence used for traffic recognition. Further, there are four types of Machine Learning, i.e. Classification (Supervised learning), clustering (Un-Supervised learning), Numeric prediction and Association. In this research paper IP traffic recognition through classification process is implemented. Different researchers are calling this process as IP traffic Recognition, IP traffic Identification, and sometimes IP traffic classification. Here Real time internet traffic has been captured using packet capturing tool and datasets has been developed. Also few standard datasets have been used in this research work. Then using standard attribute selection algorithms, a reduced statistical feature dataset has been developed. After that, Six ML algorithms AdaboostM1, C4.5, Random Forest tree, MLP, RBF and SVM with Polykernel function classifiers are used for IP traffic classification. This implementation and analysis shows that Tree based algorithms are effective ML techniques for Internet traffic classification with accuracy up to of 99.7616 %.
Keywords :
IP networks; Internet; learning (artificial intelligence); pattern recognition; statistical analysis; telecommunication traffic; IP traffic classification; IP traffic identification; IP traffic recognition; Internet traffic classification; Internet traffic flows; MLP; RBF; SVM; TCP port numbers; UDP port numbers; artificial intelligence; classification process; machine learning based Internet traffic recognition; packet attributes; packet capturing tool; packet payloads; polykernel function classifiers; random forest tree; real time Internet traffic; reduced statistical feature dataset; standard attribute selection algorithms; statistical pattern recognition; tree based algorithms; unsupervised learning; Accuracy; Classification algorithms; IP networks; Internet; Support vector machines; Training; Vegetation; AdaboostM1; C4.5; Internet Traffic Classification; MLP; Machine Learning; RBF and SVM with Polykernel function; Random Forest tree;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
India Conference (INDICON), 2013 Annual IEEE
Conference_Location :
Mumbai
Print_ISBN :
978-1-4799-2274-1
Type :
conf
DOI :
10.1109/INDCON.2013.6726074
Filename :
6726074
Link To Document :
بازگشت