• DocumentCode
    2432596
  • Title

    Development of an XML Information Retrieval System for Queries on Contents and Structures

  • Author

    Shimizu, T. ; Terada, N. ; Yoshikawa, M.

  • Author_Institution
    Graduate Sch. of Informatics, Kyoto Univ.
  • fYear
    2007
  • fDate
    29-29 Jan. 2007
  • Firstpage
    161
  • Lastpage
    168
  • Abstract
    We have developed an XML information retrieval system which can process queries by keywords or queries by combination of keywords and structural conditions. Queries by keywords are simple yet useful because users are not required to understand XML query languages or XML schema. While issuing queries by combination of keywords and structural conditions requires users to understand query languages and the underlying XML schema, we can restrict the target document fragments and the search conditions using structures in XML. The system was implemented on top of a relational XML database system developed by our group. The system can process both types of queries under a common relational schema. By carefully designing the database schema, the system handles a huge number of document fragments efficiently. For queries by keywords, we have developed a user-friendly interface for displaying search results. Our experiments using INEX test collection show that the system achieved relatively high precision and can process keyword set queries in acceptable search time
  • Keywords
    XML; information retrieval systems; query languages; query processing; XML information retrieval system; XML query languages; XML schema; keyword querying; query process; Content addressable storage; Content based retrieval; Database languages; Displays; Focusing; Informatics; Information retrieval; Information science; User interfaces; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Informatics Research for Development of Knowledge Society Infrastructure, 2007. ICKS 2007. Second International Conference on
  • Conference_Location
    Kyoto
  • Print_ISBN
    0-7695-2811-2
  • Type

    conf

  • DOI
    10.1109/ICKS.2007.9
  • Filename
    4161226