• DocumentCode
    480714
  • Title

    Indexing of Reading Paths for a Structured Information Retrieval on the Web

  • Author

    Gery, M.

  • Author_Institution
    Univ. de Lyon, St. Etienne
  • Volume
    1
  • fYear
    2008
  • fDate
    9-12 Dec. 2008
  • Firstpage
    438
  • Lastpage
    444
  • Abstract
    In this paper, we present a hyperdocument model taking into account the essential aspects of information on the Web: content, composition (logical structure) and non-linear reading (hypertext structure). We have developed a Structured Information Retrieval System (SIRS) based on this model. Its phases of indexing and querying are based on a ldquoreading pathsrdquo point of view of the Web: a Web site is considered as a set of potential reading paths, instead of a set of atomic and flat pages. We have developed an specific algorithm to index the reading paths. We present some experiments aiming at evaluating the interest of our indexing process of reading paths.
  • Keywords
    Internet; indexing; information retrieval; Web site; World Wide Web; hyperdocument model; hypertext structure; indexing process; nonlinear reading; querying; reading paths; structured information retrieval system; Content based retrieval; Context modeling; HTML; Indexing; Information retrieval; Intelligent agent; Intelligent structures; Search engines; Web pages; Web search; indexing; information retrieval; reading path; structure; web;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
  • Conference_Location
    Sydney, NSW
  • Print_ISBN
    978-0-7695-3496-1
  • Type

    conf

  • DOI
    10.1109/WIIAT.2008.386
  • Filename
    4740490