Title :
Automatic construction of Web directory using hyperlink and anchor text
Author :
Suzuki, Yusuke ; Matsubara, Shigeki ; Yoshikawa, Masatoshi
Author_Institution :
Graduate Sch. of Inf. Sci., Nagoya Univ., Japan
fDate :
30 Oct.-1 Nov. 2005
Abstract :
This paper proposes a technique for automatically constructing Web directories from several sites. To construct the hierarchical structure of the directories, the technique finds Web pages with a super-sub relation, which are connected by hyperlinks, and replaces the relation with a super-sub hierarchical relation between directories. The technique constructs hierarchical directories by iterating the integration of directories. As a result of an experiment using five Web sites, it was possible to construct hierarchical directories containing Web pages from several sites.
Keywords :
Internet; classification; Web directory; Web pages; Web sites; anchor text; hyperlink; super-sub hierarchical relation; Information science; Information technology; Web and internet services; Web pages; World Wide Web;
Conference_Titel :
Natural Language Processing and Knowledge Engineering, 2005. IEEE NLP-KE '05. Proceedings of 2005 IEEE International Conference on
Print_ISBN :
0-7803-9361-9
DOI :
10.1109/NLPKE.2005.1598810