DocumentCode :
3123552
Title :
An Incremental Threshold Method for Continuous Text Search Queries
Author :
Mouratidis, Kyriakos ; Pang, HweeHwa
Author_Institution :
Sch. of Inf. Syst., Singapore Manage. Univ., Singapore
fYear :
2009
fDate :
March 29 2009-April 2 2009
Firstpage :
1187
Lastpage :
1190
Abstract :
A text filtering system monitors a stream of incoming documents, to identify those that match the interest profiles of its users. The user interests are registered at a server as continuous text search queries. The server constantly maintains for each query a ranked result list, comprising the recent documents (drawn from a sliding window) with the highest similarity to the query. Such a system underlies many text monitoring applications that need to cope with heavy document traffic, such as news and email monitoring. In this paper, we propose the first solution for processing continuous text queries efficiently. Our objective is to support a large number of user queries while sustaining high document arrival rates. Our solution indexes the streamed documents with a structure based on the principles of the inverted file, and processes document arrival and expiration events with an incremental threshold-based method. Using a stream of real documents, we experimentally verify the efficiency of our approach, which is at least an order of magnitude faster than a competitor constructed from existing techniques.
Keywords :
information filtering; query processing; text analysis; continuous text search query; incremental threshold method; text filtering system; Conference management; Data engineering; Defense industry; Dictionaries; Filtering; Management information systems; Monitoring; Portfolios; Spatial indexes; Weapons;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering, 2009. ICDE '09. IEEE 25th International Conference on
Conference_Location :
Shanghai
ISSN :
1084-4627
Print_ISBN :
978-1-4244-3422-0
Electronic_ISBN :
1084-4627
Type :
conf
DOI :
10.1109/ICDE.2009.197
Filename :
4812497
Link To Document :
بازگشت