• DocumentCode
    3614662
  • Title

    Rainbow - multiway semantic analysis of Web sites

  • Author

    V. Svatek;J. Kosek;M. Labsky;J. Braza;M. Kavalec;M. Vacura;V. Vavra;V. Snasel

  • Author_Institution
    Dept. of Inf. & Knowledge Eng., Univ. of Econ., Prague, Czech Republic
  • fYear
    2003
  • fDate
    6/25/1905 12:00:00 AM
  • Firstpage
    635
  • Lastpage
    639
  • Abstract
    The Rainbow project aims at the development of a reusable, modular architecture for web (particularly, website) analysis. Individual knowledge-based modules separately analyse different types of web data and communicate the results via web-service interface. The output of analysis has the form of classes (of web resources) predefined in an ontology, extracted text, and/or addresses of retrieved web resources. Within the project, several original methods of analysis as well as (analytic) knowledge acquisition have been developed. The current domains of investigation are sites of small organisations offering products or services, and pornography sites. The paper is the first systematic overview of diverse methods developed or envisaged in Rainbow.
  • Keywords
    "Data mining","HTML","Ontologies","Semantic Web","Uniform resource locators","Companies","Databases","Topology","Knowledge engineering","Computer science"
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications, 2003. Proceedings. 14th International Workshop on
  • ISSN
    1529-4188
  • Print_ISBN
    0-7695-1993-8
  • Type

    conf

  • DOI
    10.1109/DEXA.2003.1232093
  • Filename
    1232093