DocumentCode :
2645712
Title :
Detecting capability evaluate of spider detection techniques
Author :
Chunlong, Fan ; Zhouhua, Yu ; Lei, Xu
Author_Institution :
Dept. of Comput. Sci., Shenyang Inst. of Aeronaut. Eng., Shenyang, China
Volume :
7
fYear :
2010
fDate :
16-18 April 2010
Abstract :
Spider is a program for harvesting internet resources. In order to regulate and monitor accessing behavior of spiders, many Spiders Detection Techniques (SDT) are proposed based on Decision Tree and Bayesian Network etc. At present, the evaluation of these detection techniques mainly rely on manual analysis of web log data. In order to objectively and accurately evaluate the detecting capability of SDT, an Evaluation Method based on Trap technique (EMT) is proposed in this paper. The principles of EMT base on the statistical hypothesis that the users captured by trap obeying binomial distribution theory. The traps layout information and users accessing information are used to calculate evaluating indicators. The evaluating result of experiment indicates that EMT has an accurate and scientific conclusion.
Keywords :
Internet; belief networks; binomial distribution; decision trees; security of data; statistical analysis; Bayesian network; EMT; SDT; Web log data; binomial distribution theory; capability evaluate; decision tree; detecting capability; evaluation method; harvesting Internet resources; spider detection techniques; statistical hypothesis; trap technique; traps layout information; Aerospace engineering; Bayesian methods; Computer science; Computerized monitoring; Decision trees; Internet; Manuals; Robots; Tin; Uniform resource locators; Spider; binomial distribution; evaluation method; spider detection techniques;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Engineering and Technology (ICCET), 2010 2nd International Conference on
Conference_Location :
Chengdu
Print_ISBN :
978-1-4244-6347-3
Type :
conf
DOI :
10.1109/ICCET.2010.5485230
Filename :
5485230
Link To Document :
بازگشت