DocumentCode :
3644109
Title :
Logical structure extraction from software requirements documents
Author :
Rehan Rauf;Michał Antkiewicz;Krzysztof Czarnecki
Author_Institution :
Generative Software Development Lab, University of Waterloo, Waterloo, Canada
fYear :
2011
Firstpage :
101
Lastpage :
110
Abstract :
Software requirements documents (SRDs) are often authored in general-purpose rich-text editors, such as MS Word. SRDs contain instances of logical structures, such as use case, business rule, and functional requirement. Automated recognition and extraction of these instances enables advanced requirements management features, such as automated traceability, template conformance checking, guided editing, and interoperability with requirements management tools such as RequisitePro. The variability in content and physical representation of these instances poses challenges to their accurate recognition and extraction. To address these challenges, we present a framework allowing 1) the specification of logical structures in terms of their content, textual rendering, and variability and 2) the extraction of instances of such structures from rich-text documents. Our evaluation involves 36 different logical structures identified in 43 SRDs and shows that the intended content, style, and variability of these structures can be specified in the framework such that their instances can be extracted from the documents with high precision and recall, both close to 100%.
Keywords :
"Unified modeling language","Feature extraction","Software","Organizations","Portable document format","Text analysis","Web pages"
Publisher :
ieee
Conference_Titel :
Requirements Engineering Conference (RE), 2011 19th IEEE International
ISSN :
1090-705X
Print_ISBN :
978-1-4577-0921-0
Type :
conf
DOI :
10.1109/RE.2011.6051638
Filename :
6051638
Link To Document :
بازگشت