DocumentCode
441785
Title
Record reduction based on attribute oriented generalization
Author
Wang, Li-zhen ; Chen, Hong-mei
Author_Institution
Sch. of Inf., Yunnan Univ., Kunming, China
Volume
3
fYear
2005
fDate
18-21 Aug. 2005
Firstpage
1693
Abstract
Record reduction is very important in the research and application of KDD. The aim of record reduction is to keep less record count and more information amount. Ratio of record reduction (RRR) and information amount based on semantic proximity (IABSP) are presented as measures. Record reduction is analyzed from two aspects of rules and measures in order to ensure the correction and effectiveness of results. In this paper, record reduction is materialized as record reduction based on attribute oriented generalization (RRBAOG). A new AOG method based on partition, prune and optimization strategies is presented in order to improve the execution efficiency of RRBAOG. Two algorithms of RRBAOG, from bottom to top (FBTT) and from top to bottom (FTTB) are also given. The efficiency of algorithms is analyzed by experiments.
Keywords
data mining; generalisation (artificial intelligence); IABSP; RRBAOG; attribute-oriented generalization; databases; from bottom to top algorithm; from top to bottom algorithm; knowledge discovery; optimization; record reduction; semantic proximity; Algorithm design and analysis; Application software; Computer science; Data mining; Data preprocessing; Databases; Machine learning; Machine learning algorithms; Optimization methods; Partitioning algorithms; Record reduction; attribute-oriented generalization; information amount based on semantic proximity; record reduction based on attribute-oriented generalization;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location
Guangzhou, China
Print_ISBN
0-7803-9091-1
Type
conf
DOI
10.1109/ICMLC.2005.1527217
Filename
1527217
Link To Document