DocumentCode
492564
Title
Symbolic mining of temporal specifications
Author
Gabel, Mark ; Su, Zhendong
Author_Institution
Dept. of Comput. Sci., Univ. of California, Davis, CA
fYear
2008
fDate
10-18 May 2008
Firstpage
51
Lastpage
60
Abstract
Program specifications are important in many phases of the software development process, but they are often omitted or incomplete. An important class of specifications takes the form of temporal properties that prescribe proper usage of components of a software system. Recent work has focused on the automated inference of temporal specifications from the static or runtime behavior of programs. Many techniques match a specification pattern (represented by a finite state automaton) to all possible combinations of program components and enumerate the possible matches. Such approaches suffer from high space complexity and have not scaled beyond simple, two-letter alternating patterns (e.g. (ab)*). In this paper, we precisely define this form of specification mining and show that its general form is NP-complete. We observe a great deal of regularity in the representation and tracking of all possible combinations of system components. This motivates us to introduce a symbolic algorithm, based on binary decision diagrams (BDDs), that exploits this regularity. Our results show that this symbolic approach expands the tractability of this problem by orders of magnitude in both time and space. It enables us to mine more complex specifications, such as the common three-letter resource acquisition, usage, and release pattern ((ab+c)*). We have implemented our algorithm in a practical tool and used it to find significant specifications in real systems, including Apache Ant and Hibernate. We then used these specifications to find previously unknown bugs.
Keywords
data mining; formal specification; program verification; NP-complete problem; automated inference; binary decision diagram; finite state automaton; high space complexity; program components; program specification; release pattern; resource acquisition; runtime behavior; software development process; software system; specification mining; specification pattern; symbolic algorithm; symbolic mining; temporal properties; temporal specification; usage pattern; Algorithm design and analysis; Boolean functions; Computer bugs; Computer science; Data structures; Learning automata; Pattern matching; Programming; Runtime; Software systems; dynamic analysis; formal specifications; specification mining;
fLanguage
English
Publisher
ieee
Conference_Titel
Software Engineering, 2008. ICSE '08. ACM/IEEE 30th International Conference on
Conference_Location
Leipzig
ISSN
0270-5257
Print_ISBN
978-1-4244-4486-1
Electronic_ISBN
0270-5257
Type
conf
DOI
10.1145/1368088.1368096
Filename
4814116
Link To Document