Title :
WIM: an information mining model for the Web
Author :
Baeza-Yates, Ricardo ; Pereira, Álvaro R., Jr. ; Ziviani, Nivio
Author_Institution :
Comput. Sci. Dept., Chile Univ., Santiago, Chile
Abstract :
This paper presents an extended abstract of a model to mine information in applications involving Web and graph analysis, referred to as WIM (Web information mining) Model. We demonstrate the model characteristics using a Web warehouse, where nodes represent Web pages and edges represent hyperlinks. In the model, objects are always sets of nodes and belong to one class. We have physical objects containing attributes directly obtained from Web pages and links, as the title of a Web page or the start and end pages of a link. Logical objects can be created by performing predefined operations on any existing object. WIM has a concise set of eleven operators. In this paper we summarizes the model components and give examples of views. A view is a sequence of operations on objects, and it represents a way to mine information in the graph.
Keywords :
Internet; data mining; data warehouses; graph theory; relational algebra; WIM; Web information mining model; Web pages; Web warehouse; graph analysis; hyperlinks; logical object creation; relational algebra; Algebra; Application software; Computer science; Conferences; HTML; Information analysis; Object oriented modeling; Relational databases; Singular value decomposition; Web pages;
Conference_Titel :
Database and Expert Systems Applications, 2005. Proceedings. Sixteenth International Workshop on
Print_ISBN :
0-7695-2424-9
DOI :
10.1109/DEXA.2005.203