DocumentCode
2328807
Title
Study on algorithms of parallel and distributed data mining calculating process
Author
Fang, Ying-Wu ; Zhao, Xiu-Bing ; Zhang, Guang-Peng ; Wang, Yi ; Sun, Yi ; Zhang, Yong-Fang
Author_Institution
Telecommun. Eng. Inst., Air Force Eng. Univ., Xi´´an, China
Volume
4
fYear
2005
fDate
18-21 Aug. 2005
Firstpage
2084
Abstract
Based on distributed data mining, a kind of parallel and distributed calculating architecture that store partition data information into sub-nodes is introduced by using a thought of partition database and improved Apriori algorithms. It lays emphasis on the data skew in the distributed environment. A converse clustering method is proposed to solve the data skew problem. The corresponding algorithms of parallel and distributed data mining are designed based on the large-scale transaction database. Calculating processes of these algorithms are described in detail. As the parallel and distributed data are processed after effective partition, the transmitted data size is greatly reduced through efficient communication among nodes. The proposed algorithms provide a flexible and extended calculation platform, reduce overhead traffic, and keep a favorable expansibility. The proposed algorithms aim at performing network calculation and finding advantages of network calculation by using a fairly cheap computer. The proposed algorithms can be applied to large parallel or distributed single computer environment.
Keywords
data mining; parallel algorithms; transaction processing; very large databases; converse clustering; data mining; data skew; distributed algorithm; improved Apriori algorithm; large-scale transaction database; network calculation; parallel algorithm; partition database; Algorithm design and analysis; Clustering algorithms; Clustering methods; Computer networks; Data mining; Distributed databases; Large-scale systems; Partitioning algorithms; Telecommunication traffic; Transaction databases; Partition database; converse clustering; data mining; parallel and distributed algorithms;
fLanguage
English
Publisher
ieee
Conference_Titel
Machine Learning and Cybernetics, 2005. Proceedings of 2005 International Conference on
Conference_Location
Guangzhou, China
Print_ISBN
0-7803-9091-1
Type
conf
DOI
10.1109/ICMLC.2005.1527289
Filename
1527289
Link To Document