• DocumentCode
    3123614
  • Title

    Scalable Keyword Search on Large Data Streams

  • Author

    Qin, Lu ; Yu, Jeffrey Xu ; Chang, Lijun ; Tao, Yufei

  • Author_Institution
    Chinese Univ. of Hong Kong, Hong Kong
  • fYear
    2009
  • fDate
    March 29 2009-April 2 2009
  • Firstpage
    1199
  • Lastpage
    1202
  • Abstract
    It is widely realized that the integration of information retrieval (IR) and database (DB) techniques provides users with a broad range of high quality services. A new challenging issue along the same direction is IR-styled m-keyword query processing in a RDBMS framework over an open-ended relational data stream. The capability of supporting m-keyword queries over a relational data stream makes it possible for users to monitor events, that are implicitly interrelated, over a relational data stream in a timely manner. In brief, the problem is to find all connected trees whose size is less than or equal to a user-given threshold in terms of number of nodes for a m-keyword query, {k1, k2, middot middot middot , km}, over a relational data stream on a database schema GS. The difficulty of the problem is related to the number of costly joins to be processed over time, which is affected by the parameters such as the number of keywords (m), the maximum size of connected trees (Tmax), as well as the complexity of the database schema when it is viewed as a schema graph (GS). In this paper, we propose a new demand-driven approach to process such a query over a high speed data stream. We show that we can significantly reduce the number of intermediate results when processing joins over a data stream, and therefore can achieve high efficiency.
  • Keywords
    computational complexity; graph theory; query processing; relational databases; IR-styled m-keyword query processing; RDBMS framework; database techniques; information retrieval; open-ended relational data stream; scalable keyword search; schema graph; Costs; Data engineering; Information retrieval; Keyword search; Monitoring; Product design; Query processing; Relational databases; Tree graphs; relational database stream;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
  • Conference_Location
    Shanghai
  • ISSN
    1084-4627
  • Print_ISBN
    978-1-4244-3422-0
  • Electronic_ISBN
    1084-4627
  • Type

    conf

  • DOI
    10.1109/ICDE.2009.200
  • Filename
    4812500