DocumentCode
2583041
Title
An Extended Vector Space Model for XML Information Retrieval
Author
Guo Yongming ; Chen Dehua ; Le Jiajin
Author_Institution
Coll. of Inf. Sci. & Technol., Donghua Univ., Shanghai
fYear
2009
fDate
23-25 Jan. 2009
Firstpage
797
Lastpage
800
Abstract
With the emergence of more and more XML documents, effectively and efficiently retrieving information from XML documents has become an active research area. Since XML documents lie between structured data and unstructured data which describe both content and structure, it is a huge challenge for effectively and efficiently retrieving information from XML documents. This paper develops a novel retrieval model named as extend vector space model which effectively combines XPath and vector space model for XML information retrieval. A prototype system for XML information retrieval based on this retrieval model has been implemented, and several corresponding algorithms have been introduced. The experiments show that this model has effectively improved recall and precision.
Keywords
XML; information retrieval; XML documents; XML information retrieval; XPath; extended vector space model; retrieval model; structured data; unstructured data; Books; Content based retrieval; Data mining; Database languages; Educational institutions; Information retrieval; Prototypes; Q measurement; Space technology; XML; Extended Vectoe Space Model; XML; XPath; informatiion retrieval;
fLanguage
English
Publisher
ieee
Conference_Titel
Knowledge Discovery and Data Mining, 2009. WKDD 2009. Second International Workshop on
Conference_Location
Moscow
Print_ISBN
978-0-7695-3543-2
Type
conf
DOI
10.1109/WKDD.2009.218
Filename
4772056
Link To Document