Title :
Web warehousing: an algebra for web information
Author :
Ng, W.-K. ; Lim, E.-P. ; Huang, C.-T. ; Bhowmick, S. ; Qin, F.-Q.
Author_Institution :
Sch. of Appl. Sci., Nanyang Technol. Inst., Singapore
Abstract :
While conventional keyword indexes maintained by web search engines such as Yahoo, Lycos, and World Wide Web Worm work well for most simple keyword searches, they are inadequate when more complex and structured queries involving the underlying hypertext structure of the World Wide Web are desired. Building from a database perspective, existing work to support such queries focuses on constructing SQL-like query languages for the WWW that assumes a relational abstraction of the WWW. Nonetheless, the WWW is a directed graph and imposing a relational abstraction filters out its inherent topological structure. We propose a data model for the WWW that retains its topological structure and construct a web algebra to manipulate objects in this model. The web algebra establishes a formal foundation from which different web query languages can be designed
Keywords :
Internet; hypermedia; indexing; information retrieval; query languages; Lycos; SQL; Web warehousing; World Wide Web Worm; Yahoo; data model; directed graph; hypertext structure; keyword indexes; query languages; relational abstraction; structured queries; topological structure; web algebra; web information; web query languages; web search engines; Algebra; Buildings; Database languages; Keyword search; Relational databases; Search engines; Warehousing; Web search; Web sites; World Wide Web;
Conference_Titel :
Research and Technology Advances in Digital Libraries, 1998. ADL 98. Proceedings. IEEE International Forum on
Conference_Location :
Santa Barbara, CA
Print_ISBN :
0-8186-8464-X
DOI :
10.1109/ADL.1998.670423