Title :
Crowdsourcing Enumeration Queries: Estimators and Interfaces
Author :
Trushkowsky, Beth ; Kraska, Tim ; Franklin, Michael J. ; Sarkar, Purnamrita ; Ramachandran, Venketaram
Author_Institution :
Dept. of Comput. Sci., Harvey Mudd Coll., Claremont, CA, USA
Abstract :
Hybrid human/computer database systems promise to greatly expand the usefulness of query processing by incorporating the crowd for data gathering and other tasks. Such systems raise many implementation questions. Perhaps the most fundamental issue is that the closed world assumption underlying relational query semantics does not hold in such systems. As a consequence, the meaning of even simple queries can be called into question. Furthermore, query progress monitoring becomes difficult due to non-uniformities in the arrival of crowd-sourced data and peculiarities of how people work in crowd-sourcing systems. To address these issues, we develop statistical tools that enable users and systems developers to reason about query completeness. These tools can also help drive query execution and crowd-sourcing strategies. We evaluate our techniques using experiments on a popular crowd-sourcing platform.
Keywords :
database management systems; query processing; statistical analysis; user interfaces; crowdsourcing enumeration queries; data gathering; estimator; hybrid human/computer database systems; interface; query completeness; query execution; query processing; query progress monitoring; relational query semantics; statistical tools; Computers; Crowdsourcing; Estimation; Query processing; Sociology; Database design; modeling and management; user interfaces;
Journal_Title :
Knowledge and Data Engineering, IEEE Transactions on
DOI :
10.1109/TKDE.2014.2339857