DocumentCode
2485183
Title
Mining Prevalence-Based Ratio Patterns
Author
Zhang, Minghua ; Hsu, Wynne ; Lee, Mong Li
Author_Institution
Nat. Univ. of Singapore, Singapore
Volume
2
fYear
2007
fDate
29-31 Oct. 2007
Firstpage
140
Lastpage
147
Abstract
Association rule mining aims to discover sets of features that occur together. A variation of association rule mining is ratio rule mining. A ratio rule is an eigenvector of the database that describes ratios of features. However, ratio rules are sensitive to outliers. In this work, we design a prevalence-based model for mining ratio patterns from a database. Our model is more robust to noises, and ratio patterns in our model have clear statistic meanings. We develop an algorithm to quickly determine the sets of features and their ratios that satisfy the prevalence requirement. Data structures, such as hash table and hash tree are utilized to further improve the efficiency of the algorithm. Experiments on synthetic data indicates the efficiency and scalability of the proposed algorithm. We also present a case study on US census data.
Keywords
data mining; tree data structures; US census data; association rule mining; data structures; hash table; hash tree; prevalence-based ratio patterns mining; Association rules; Data mining; Educational institutions; Marketing and sales; Noise robustness; Remuneration; Signal to noise ratio; Spatial databases; Transaction databases; Tree data structures;
fLanguage
English
Publisher
ieee
Conference_Titel
Tools with Artificial Intelligence, 2007. ICTAI 2007. 19th IEEE International Conference on
Conference_Location
Patras
ISSN
1082-3409
Print_ISBN
978-0-7695-3015-4
Type
conf
DOI
10.1109/ICTAI.2007.95
Filename
4410371
Link To Document