• DocumentCode
    357723
  • Title

    Structuring the Web

  • Author

    Carchiolo, V. ; Longheu, A. ; Malgeri, M.

  • Author_Institution
    Istituto di Inf. e Telecommun., Catania Univ., Italy
  • fYear
    2000
  • fDate
    2000
  • Firstpage
    1123
  • Lastpage
    1127
  • Abstract
    The WWW is a very large and rich information source but with no structure, so locating data of interest may be difficult. In particular a page may be divided into different logical sections of information, whose highlighting may improve both browsing and searching. We propose a simple Web page structuring, by introducing the “semantic block” as a more granular level to categorize information inside a page. We also propose a set of XML tags to be added to the existing HTML tags in order to locate such blocks and to use structured pages both with current and future, structure-aware browsers, reaching the goal of a gradual migration towards a more structured Web. We explore our technique on several Web sites, in order to detect which semantic blocks are needed, also using two simple Java-based tools we developed to add XML tags and manage such structure. Finally, we consider how schema can be represented for a better browsing
  • Keywords
    hypermedia markup languages; information resources; information retrieval; online front-ends; HTML tags; Java-based tools; WWW; Web page structuring; Web sites; XML tags; semantic block; structure-aware browsers; Data structures; Graphics; HTML; Java; Knowledge management; Telecommunications; Web pages; World Wide Web; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database and Expert Systems Applications, 2000. Proceedings. 11th International Workshop on
  • Conference_Location
    London
  • ISSN
    1529-4188
  • Print_ISBN
    0-7695-0680-1
  • Type

    conf

  • DOI
    10.1109/DEXA.2000.875167
  • Filename
    875167