Title :
Research on improving methods of preprocessing in web log mining
Author :
Zhou, Huaqiang ; Gao, Hongxia ; Xiao, Han
Author_Institution :
School of Computer Science, zhongyuan University of technology, Zhengzhou, China
Abstract :
This paper analyzes the web logs by using data mining technology. Through structure mining, usage mining and content mining, analyzes and studies the existing problems in current mining technology. Aiming at relevant key preprocessing schedule such as Frame page filtration, time-out threshold value setting and long-time setting, etc. this paper puts forward a modified data preprocessing technical method and compares the mining results through tests before and after the modification. Experiment proves that, the modified preprocessing technology is feasible and it can solve problems existing in relevant preprocessing effectively.
Keywords :
Algorithm design and analysis; Computers; Data mining; Decision trees; Filtering; Filtration; Heuristic algorithms; Data mining; ID3 algorithm; preprocessing; web log mining;
Conference_Titel :
Information Science and Engineering (ICISE), 2010 2nd International Conference on
Conference_Location :
Hangzhou, China
Print_ISBN :
978-1-4244-7616-9
DOI :
10.1109/ICISE.2010.5691732