DocumentCode
480714
Title
Indexing of Reading Paths for a Structured Information Retrieval on the Web
Author
Gery, M.
Author_Institution
Univ. de Lyon, St. Etienne
Volume
1
fYear
2008
fDate
9-12 Dec. 2008
Firstpage
438
Lastpage
444
Abstract
In this paper, we present a hyperdocument model taking into account the essential aspects of information on the Web: content, composition (logical structure) and non-linear reading (hypertext structure). We have developed a Structured Information Retrieval System (SIRS) based on this model. Its phases of indexing and querying are based on a ldquoreading pathsrdquo point of view of the Web: a Web site is considered as a set of potential reading paths, instead of a set of atomic and flat pages. We have developed an specific algorithm to index the reading paths. We present some experiments aiming at evaluating the interest of our indexing process of reading paths.
Keywords
Internet; indexing; information retrieval; Web site; World Wide Web; hyperdocument model; hypertext structure; indexing process; nonlinear reading; querying; reading paths; structured information retrieval system; Content based retrieval; Context modeling; HTML; Indexing; Information retrieval; Intelligent agent; Intelligent structures; Search engines; Web pages; Web search; indexing; information retrieval; reading path; structure; web;
fLanguage
English
Publisher
ieee
Conference_Titel
Web Intelligence and Intelligent Agent Technology, 2008. WI-IAT '08. IEEE/WIC/ACM International Conference on
Conference_Location
Sydney, NSW
Print_ISBN
978-0-7695-3496-1
Type
conf
DOI
10.1109/WIIAT.2008.386
Filename
4740490
Link To Document