Title :
Automated Information Extraction from Empirical Software Engineering Literature: Is that possible?
Author :
Cruzes, Daniela ; Mendonca, Manoel ; Basili, Victor ; Shull, Forrest ; Jino, Mario
Author_Institution :
NUPERC/UNIFACS, Salvador
Abstract :
The number of scientific publications is constantly increasing, and the results published on empirical software engineering are growing even faster. Some software engineering publishers have begun to collaborate with research groups to make available repositories of software engineering empirical data. However, these initiatives are limited due to data ownership and privacy issues. As a result, many researchers in the area have adopted systematic reviews as a mean to extract empirical evidence from published material. Systematic reviews are labor intensive and costly. In this paper, we argue that the use of information extraction tools can support systematic reviews and significantly speed up the creation of repositories of SE empirical evidence.
Keywords :
electronic publishing; information retrieval; software engineering; automated information extraction; empirical software engineering literature; software engineering publisher; systematic reviews; Data mining; Drugs; Educational institutions; Information analysis; Natural language processing; Natural languages; Proteins; Software engineering; Software measurement; Text mining;
Conference_Titel :
Empirical Software Engineering and Measurement, 2007. ESEM 2007. First International Symposium on
Conference_Location :
Madrid
Print_ISBN :
978-0-7695-2886-1
DOI :
10.1109/ESEM.2007.62