• DocumentCode
    2534286
  • Title

    Indexing relational database content offline for efficient keyword-based search

  • Author

    Su, Qi ; Widom, Jennifer

  • Author_Institution
    Stanford Univ., CA, USA
  • fYear
    2005
  • fDate
    25-27 July 2005
  • Firstpage
    297
  • Lastpage
    306
  • Abstract
    Information retrieval systems such as Web search engines offer convenient keyword-based search interfaces. In contrast, relational database systems require the user to learn SQL and to know the schema of the underlying data even to pose simple searches. We propose an architecture that supports highly efficient keyword-based search over relational databases: A relational database is "crawled" in advance, text-indexing virtual documents that correspond to interconnected database content. At query time, the text index supports keyword-based searches with interactive response, identifying database objects corresponding to the virtual documents matching the query. Our system, EKSO, creates virtual documents from joining relational tuples and uses the DB2 Net Search Extender for indexing and keyword-search processing. Experimental results show that index size is manageable and database updates (which are propagated incrementally as recomputed virtual documents to the text index) do not significantly hinder query performance. We also present a user study confirming the superiority of keyword-based search over SQL for a range of database retrieval tasks.
  • Keywords
    content-based retrieval; database indexing; relational databases; DB2 Net Search Extender; EKSO system; database indexing; database object identification; database retrieval; information retrieval systems; interconnected database content; keyword-based search; query matching; relational database; relational tuples; text-indexing virtual documents; Content based retrieval; Crawlers; Indexes; Indexing; Information retrieval; Keyword search; Neutron spin echo; Relational databases; Search engines; Web search;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Database Engineering and Application Symposium, 2005. IDEAS 2005. 9th International
  • ISSN
    1098-8068
  • Print_ISBN
    0-7695-2404-4
  • Type

    conf

  • DOI
    10.1109/IDEAS.2005.36
  • Filename
    1540920