DocumentCode :
2160643
Title :
Research on improving methods of preprocessing in web log mining
Author :
Zhou, Huaqiang ; Gao, Hongxia ; Xiao, Han
Author_Institution :
School of Computer Science, zhongyuan University of technology, Zhengzhou, China
fYear :
2010
fDate :
4-6 Dec. 2010
Firstpage :
1472
Lastpage :
1474
Abstract :
This paper analyzes the web logs by using data mining technology. Through structure mining, usage mining and content mining, analyzes and studies the existing problems in current mining technology. Aiming at relevant key preprocessing schedule such as Frame page filtration, time-out threshold value setting and long-time setting, etc. this paper puts forward a modified data preprocessing technical method and compares the mining results through tests before and after the modification. Experiment proves that, the modified preprocessing technology is feasible and it can solve problems existing in relevant preprocessing effectively.
Keywords :
Algorithm design and analysis; Computers; Data mining; Decision trees; Filtering; Filtration; Heuristic algorithms; Data mining; ID3 algorithm; preprocessing; web log mining;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4244-7616-9
Type :
conf
DOI :
10.1109/ICISE.2010.5691732
Filename :
5691732
Link To Document :
بازگشت