DocumentCode
2160643
Title
Research on improving methods of preprocessing in web log mining
Author
Zhou, Huaqiang ; Gao, Hongxia ; Xiao, Han
Author_Institution
School of Computer Science, zhongyuan University of technology, Zhengzhou, China
fYear
2010
fDate
4-6 Dec. 2010
Firstpage
1472
Lastpage
1474
Abstract
This paper analyzes the web logs by using data mining technology. Through structure mining, usage mining and content mining, analyzes and studies the existing problems in current mining technology. Aiming at relevant key preprocessing schedule such as Frame page filtration, time-out threshold value setting and long-time setting, etc. this paper puts forward a modified data preprocessing technical method and compares the mining results through tests before and after the modification. Experiment proves that, the modified preprocessing technology is feasible and it can solve problems existing in relevant preprocessing effectively.
Keywords
Algorithm design and analysis; Computers; Data mining; Decision trees; Filtering; Filtration; Heuristic algorithms; Data mining; ID3 algorithm; preprocessing; web log mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location
Hangzhou, China
Print_ISBN
978-1-4244-7616-9
Type
conf
DOI
10.1109/ICISE.2010.5691732
Filename
5691732
Link To Document