• DocumentCode
    2583041
  • Title

    An Extended Vector Space Model for XML Information Retrieval

  • Author

    Guo Yongming ; Chen Dehua ; Le Jiajin

  • Author_Institution
    Coll. of Inf. Sci. & Technol., Donghua Univ., Shanghai
  • fYear
    2009
  • fDate
    23-25 Jan. 2009
  • Firstpage
    797
  • Lastpage
    800
  • Abstract
    With the emergence of more and more XML documents, effectively and efficiently retrieving information from XML documents has become an active research area. Since XML documents lie between structured data and unstructured data which describe both content and structure, it is a huge challenge for effectively and efficiently retrieving information from XML documents. This paper develops a novel retrieval model named as extend vector space model which effectively combines XPath and vector space model for XML information retrieval. A prototype system for XML information retrieval based on this retrieval model has been implemented, and several corresponding algorithms have been introduced. The experiments show that this model has effectively improved recall and precision.
  • Keywords
    XML; information retrieval; XML documents; XML information retrieval; XPath; extended vector space model; retrieval model; structured data; unstructured data; Books; Content based retrieval; Data mining; Database languages; Educational institutions; Information retrieval; Prototypes; Q measurement; Space technology; XML; Extended Vectoe Space Model; XML; XPath; informatiion retrieval;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Knowledge Discovery and Data Mining, 2009. WKDD 2009. Second International Workshop on
  • Conference_Location
    Moscow
  • Print_ISBN
    978-0-7695-3543-2
  • Type

    conf

  • DOI
    10.1109/WKDD.2009.218
  • Filename
    4772056