DocumentCode
1338858
Title
Answering General Time-Sensitive Queries
Author
Dakka, Wisam ; Gravano, Luis ; Ipeirotis, Panagiotis G.
Author_Institution
Google, Mountain View, CA, USA
Volume
24
Issue
2
fYear
2012
Firstpage
220
Lastpage
235
Abstract
Time is an important dimension of relevance for a large number of searches, such as over blogs and news archives. So far, research on searching over such collections has largely focused on locating topically similar documents for a query. Unfortunately, topic similarity alone is not always sufficient for document ranking. In this paper, we observe that, for an important class of queries that we call time-sensitive queries, the publication time of the documents in a news archive is important and should be considered in conjunction with the topic similarity to derive the final document ranking. Earlier work has focused on improving retrieval for “recency” queries that target recent documents. We propose a more general framework for handling time-sensitive queries and we automatically identify the important time intervals that are likely to be of interest for a query. Then, we build scoring techniques that seamlessly integrate the temporal aspect into the overall ranking mechanism. We present an extensive experimental evaluation using a variety of news article data sets, including TREC data as well as real web data analyzed using the Amazon Mechanical Turk. We examine several techniques for detecting the important time intervals for a query over a news archive and for incorporating this information in the retrieval process. We show that our techniques are robust and significantly improve result quality for time-sensitive queries compared to state-of-the-art retrieval techniques.
Keywords
document handling; query processing; document ranking; information retrieval process; news archive; news article data set; recency query; scoring technique; time-sensitive query; topic similarity; Computational modeling; Google; Histograms; Information retrieval; Mathematical model; Query processing; Search methods; Smoothing methods; Time frequency analysis; Information search and retrieval; processing time-sensitive queries; time-sensitive search.;
fLanguage
English
Journal_Title
Knowledge and Data Engineering, IEEE Transactions on
Publisher
ieee
ISSN
1041-4347
Type
jour
DOI
10.1109/TKDE.2010.187
Filename
5590248
Link To Document