• DocumentCode
    3009886
  • Title

    Improving Content-Oriented XML Retrieval by Exploiting Small Elements

  • Author

    Dopichaj, Philipp

  • Author_Institution
    Univ. of Kaiserslautern, Kaiserslautern
  • fYear
    2007
  • fDate
    3-5 July 2007
  • Firstpage
    68
  • Lastpage
    74
  • Abstract
    XML element retrieval aims at finding the best elements satisfying a user´s information need. Elements spanning only a few words, like titles or italicized phrases, are not in themselves useful results, but they can support the relevance of their enclosing elements. For example, if a section´s title contains the key words from the user´s query, the title itself is unlikely to be a useful result, but the section is very likely to be useful. This paper provides an overview of methods for exploiting small elements for better retrieval results, highlighting their respective advantages and disadvantages. Using the INEX testbed, we show that small elements can indeed provide useful retrieval hints, and we evaluate the trade-offs.
  • Keywords
    XML; content-based retrieval; information needs; INEX testbed; XML element retrieval; content-oriented XML retrieval; information need; user query; Content based retrieval; Context modeling; Engines; HTML; Information retrieval; Sections; Spatial databases; Testing; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Databases, 2007. BNCOD '07. 24th British National Conference on
  • Conference_Location
    Glasgow
  • Print_ISBN
    0-7695-2912-7
  • Type

    conf

  • DOI
    10.1109/BNCOD.2007.12
  • Filename
    4269819