DocumentCode :
257585
Title :
Scaling requirements extraction to the crowd: Experiments with privacy policies
Author :
Breaux, Travis D. ; Schaub, Florian
Author_Institution :
Inst. for Software Res., Carnegie Mellon Univ., Pittsburgh, PA, USA
fYear :
2014
fDate :
25-29 Aug. 2014
Firstpage :
163
Lastpage :
172
Abstract :
Natural language text sources have increasingly been used to develop new methods and tools for extracting and analyzing requirements. To validate these new approaches, researchers rely on a small number of trained experts to perform a labor-intensive manual analysis of the text. The time and resources needed to conduct manual extraction, however, has limited the size of case studies and thus the generalizability of results. To begin to address this issue, we conducted three experiments to evaluate crowdsourcing a manual requirements extraction task to a larger number of untrained workers. In these experiments, we carefully balance worker payment and overall cost, as well as worker training and data quality to study the feasibility of distributing requirements extraction to the crowd. The task consists of extracting descriptions of data collection, sharing and usage requirements from privacy policies. We present results from two pilot studies and a third experiment to justify applying a task decomposition approach to requirements extraction. Our contributions include the task decomposition workflow and three metrics for measuring worker performance. The final evaluation shows a 60% reduction in the cost of manual extraction with a 16% increase in extraction coverage.
Keywords :
data acquisition; data privacy; formal specification; natural language processing; text analysis; crowdsourcing; data collection; data quality; data sharing; descriptions extraction; manual requirements extraction task; natural language text sources; overall cost; privacy policies; task decomposition approach; task decomposition workflow; untrained workers; usage requirements; worker payment; worker performance; worker training; Crowdsourcing; Data mining; Encoding; Manuals; Measurement; Natural languages; Privacy; crowdsourcing; natural language; requirements extraction;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Requirements Engineering Conference (RE), 2014 IEEE 22nd International
Conference_Location :
Karlskrona
Print_ISBN :
978-1-4799-3031-9
Type :
conf
DOI :
10.1109/RE.2014.6912258
Filename :
6912258
Link To Document :
بازگشت