• DocumentCode
    2160643
  • Title

    Research on improving methods of preprocessing in web log mining

  • Author

    Zhou, Huaqiang ; Gao, Hongxia ; Xiao, Han

  • Author_Institution
    School of Computer Science, zhongyuan University of technology, Zhengzhou, China
  • fYear
    2010
  • fDate
    4-6 Dec. 2010
  • Firstpage
    1472
  • Lastpage
    1474
  • Abstract
    This paper analyzes the web logs by using data mining technology. Through structure mining, usage mining and content mining, analyzes and studies the existing problems in current mining technology. Aiming at relevant key preprocessing schedule such as Frame page filtration, time-out threshold value setting and long-time setting, etc. this paper puts forward a modified data preprocessing technical method and compares the mining results through tests before and after the modification. Experiment proves that, the modified preprocessing technology is feasible and it can solve problems existing in relevant preprocessing effectively.
  • Keywords
    Algorithm design and analysis; Computers; Data mining; Decision trees; Filtering; Filtration; Heuristic algorithms; Data mining; ID3 algorithm; preprocessing; web log mining;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Science and Engineering (ICISE), 2010 2nd International Conference on
  • Conference_Location
    Hangzhou, China
  • Print_ISBN
    978-1-4244-7616-9
  • Type

    conf

  • DOI
    10.1109/ICISE.2010.5691732
  • Filename
    5691732