DocumentCode :
3143365
Title :
Bidirectional mining of non-redundant recurrent rules from a sequence database
Author :
Lo, David ; Ding, Bolin ; Lucia ; Han, Jiawei
Author_Institution :
Sch. of Inf. Syst., Singapore Manage. Univ., Singapore, Singapore
fYear :
2011
fDate :
11-16 April 2011
Firstpage :
1043
Lastpage :
1054
Abstract :
We are interested in scalable mining of a non-redundant set of significant recurrent rules from a sequence database. Recurrent rules have the form “whenever a series of precedent events occurs, eventually a series of consequent events occurs”. They are intuitive and characterize behaviors in many domains. An example is the domain of software specification, in which the rules capture a family of properties beneficial to program verification and bug detection. We enhance a past work on mining recurrent rules by Lo, Khoo, and Liu to perform mining more scalably. We propose a new set of pruning properties embedded in a new mining algorithm. Performance and case studies on benchmark synthetic and real datasets show that our approach is much more efficient and outperforms the state-of-the-art approach in mining recurrent rules by up to two orders of magnitude.
Keywords :
data mining; formal specification; bidirectional mining; bug detection; nonredundant recurrent rule mining; program verification; pruning property; sequence database; software specification domain; Association rules; Complexity theory; Databases; Redundancy; Semantics; Software;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Engineering (ICDE), 2011 IEEE 27th International Conference on
Conference_Location :
Hannover
ISSN :
1063-6382
Print_ISBN :
978-1-4244-8959-6
Electronic_ISBN :
1063-6382
Type :
conf
DOI :
10.1109/ICDE.2011.5767848
Filename :
5767848
Link To Document :
بازگشت