Title :
Web pages clustering based on web patterns
Author :
Kudelka, Milos ; Lehecka, Ondrej ; Snasel, Vaclav ; El-Qawasmeh, Eyas
Author_Institution :
Computer Science Dept., VSB - Technical University of Ostrava, Czech Republic
Abstract :
In this paper was proposed a new approach to semantic analysis of web pages. To prove the efficiency of this approach, we designed a method for analysis and evaluation of web pages. The method is built on a silent agreement between web designers and users. The key aspects of this agreement are web patterns which are used by web designers in their web page implementations. With our method, we can find out whether the pattern is presented on the page with a high level of relevance. The extracted patterns are considered as semantic features representing the contents of Web pages. In this lecture, we explain essentials of our approach as well as key features of our method and context for proper usage.
Keywords :
Computer science; Data mining; Design methodology; HTML; Ontologies; Pattern analysis; Search engines; Taxonomy; Testing; Web pages;
Conference_Titel :
Digital Information Management, 2007. ICDIM '07. 2nd International Conference on
Conference_Location :
Lyon, France
Print_ISBN :
978-1-4244-1475-8
Electronic_ISBN :
978-1-4244-1476-5
DOI :
10.1109/ICDIM.2007.4444299