DocumentCode
2133997
Title
Web archiving strategies by using Web mining techniques
Author
Kawano, Hiroyuki
Author_Institution
Dept. of Syst. Sci., Kyoto Univ., Japan
Volume
2
fYear
2003
fDate
28-30 Aug. 2003
Firstpage
915
Abstract
For preserving huge volume of born-digital information in the Internet, national diet library in Japan has been developing a experimental web archiving system, WARP (http://warp.ndl.go.jp/). However, in order to handle monotonously increasing digital information, we consider many difficult problems of long life data preservation from various technical aspects. In this paper, we try to apply web mining techniques to web archiving strategies. Our strategies are based on the experiences of our Mondou web search engine and web robots, which are based on text/web mining technologies.
Keywords
Internet; information storage; search engines; Internet; Mondou web search engine; Web archiving strategies; Web mining techniques; digital information storage; national diet library; web robots; Data mining; Data visualization; Database systems; Internet; Robots; Search engines; Software libraries; Web mining; Web search; Web server;
fLanguage
English
Publisher
ieee
Conference_Titel
Communications, Computers and signal Processing, 2003. PACRIM. 2003 IEEE Pacific Rim Conference on
Print_ISBN
0-7803-7978-0
Type
conf
DOI
10.1109/PACRIM.2003.1235932
Filename
1235932
Link To Document