• DocumentCode
    2133997
  • Title

    Web archiving strategies by using Web mining techniques

  • Author

    Kawano, Hiroyuki

  • Author_Institution
    Dept. of Syst. Sci., Kyoto Univ., Japan
  • Volume
    2
  • fYear
    2003
  • fDate
    28-30 Aug. 2003
  • Firstpage
    915
  • Abstract
    For preserving huge volume of born-digital information in the Internet, national diet library in Japan has been developing a experimental web archiving system, WARP (http://warp.ndl.go.jp/). However, in order to handle monotonously increasing digital information, we consider many difficult problems of long life data preservation from various technical aspects. In this paper, we try to apply web mining techniques to web archiving strategies. Our strategies are based on the experiences of our Mondou web search engine and web robots, which are based on text/web mining technologies.
  • Keywords
    Internet; information storage; search engines; Internet; Mondou web search engine; Web archiving strategies; Web mining techniques; digital information storage; national diet library; web robots; Data mining; Data visualization; Database systems; Internet; Robots; Search engines; Software libraries; Web mining; Web search; Web server;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
  • Print_ISBN
    0-7803-7978-0
  • Type

    conf

  • DOI
    10.1109/PACRIM.2003.1235932
  • Filename
    1235932