Title :
Scalable top-k spatio-temporal term querying
Author :
Skovsgaard, Anders ; Sidlauskas, Darius ; Jensen, Christian S.
Author_Institution :
Dept. of Comput. Sci., Aarhus Univ., Aarhus, Denmark
fDate :
March 31 2014-April 4 2014
Abstract :
With the rapidly increasing deployment of Internet-connected, location-aware mobile devices, very large and increasing amounts of geo-tagged and timestamped user-generated content, such as microblog posts, are being generated. We present indexing, update, and query processing techniques that are capable of providing the top-k terms seen in posts in a user-specified spatio-temporal range. The techniques enable interactive response times in the millisecond range in a realistic setting where the arrival rate of posts exceeds today´s average tweet arrival rate by a factor of 4-10. The techniques adaptively maintain the most frequent items at various spatial and temporal granularities. They extend existing frequent item counting techniques to maintain exact counts rather than approximations. An extensive empirical study with a large collection of geo-tagged tweets shows that the proposed techniques enable online aggregation and query processing at scale in realistic settings.
Keywords :
Internet; indexing; mobile computing; query processing; social networking (online); Internet-connected device; frequent item counting techniques; geo-tagged content; geo-tagged tweets; indexing technique; interactive response times; location-aware mobile devices; online aggregation; query processing techniques; scalable top-k spatio-temporal term querying; spatial granularities; temporal granularities; timestamped user-generated content; update technique; user-specified spatio-temporal range; Aggregates; Heuristic algorithms; Indexing; Merging; Monitoring; Radiation detectors;
Conference_Titel :
Data Engineering (ICDE), 2014 IEEE 30th International Conference on
Conference_Location :
Chicago, IL
DOI :
10.1109/ICDE.2014.6816647