Title :
SPARK2: Top-k Keyword Query in Relational Databases
Author :
Luo, Yi ; Wang, Wei ; Lin, Xuemin ; Zhou, Xiaofang ; Wang, Jianmin ; Li, Keqiu
Author_Institution :
Lab. Le2i, CNRS Dijon, Dijon, France
Abstract :
With the increasing amount of text data stored in relational databases, there is a demand for RDBMS to support keyword queries over text data. As a search result is often assembled from multiple relational tables, traditional IR-style ranking and query evaluation methods cannot be applied directly. In this paper, we study the effectiveness and the efficiency issues of answering top-k keyword query in relational database systems. We propose a new ranking formula by adapting existing IR techniques based on a natural notion of virtual document. We also propose several efficient query processing methods for the new ranking method. We have conducted extensive experiments on large-scale real databases using two popular RDBMSs. The experimental results demonstrate significant improvement to the alternative approaches in terms of retrieval effectiveness and efficiency.
Keywords :
query processing; question answering (information retrieval); relational databases; text analysis; IR-style ranking; RDBMS; SPARK2; effectiveness issues; efficiency issues; large-scale real database; multiple relational table; query evaluation method; query processing method; relational database; text data storage; top-k keyword query answering; virtual document; Electronic mail; Information retrieval; Keyword search; Query processing; Relational databases; Semantics; Top-k; information retrieval.; keyword search; relational database;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2011.60