Title :
Rainbow - multiway semantic analysis of Web sites
Author :
V. Svatek;J. Kosek;M. Labsky;J. Braza;M. Kavalec;M. Vacura;V. Vavra;V. Snasel
Author_Institution :
Dept. of Inf. & Knowledge Eng., Univ. of Econ., Prague, Czech Republic
fDate :
6/25/1905 12:00:00 AM
Abstract :
The Rainbow project aims at the development of a reusable, modular architecture for web (particularly, website) analysis. Individual knowledge-based modules separately analyse different types of web data and communicate the results via web-service interface. The output of analysis has the form of classes (of web resources) predefined in an ontology, extracted text, and/or addresses of retrieved web resources. Within the project, several original methods of analysis as well as (analytic) knowledge acquisition have been developed. The current domains of investigation are sites of small organisations offering products or services, and pornography sites. The paper is the first systematic overview of diverse methods developed or envisaged in Rainbow.
Keywords :
"Data mining","HTML","Ontologies","Semantic Web","Uniform resource locators","Companies","Databases","Topology","Knowledge engineering","Computer science"
Conference_Titel :
Database and Expert Systems Applications, 2003. Proceedings. 14th International Workshop on
Print_ISBN :
0-7695-1993-8
DOI :
10.1109/DEXA.2003.1232093