Title :
Distributed Linear Programming and Resource Management for Data Mining in Distributed Environments
Author :
Dutta, Haimonti ; Kargupta, Hillol
Author_Institution :
Center for Comput. Learning Syst., Columbia Univ., New York, NY
Abstract :
Advances in computing and communication has resulted in very large scale distributed environments in recent years. They are capable of storing large volumes of data and often have multiple compute nodes. However, the inherent heterogeneity of data components, the dynamic nature of distributed systems, the need for information synchronization and data fusion over a network and security and access control issues makes the problem of resource management and monitoring a tremendous challenge. In particular, centralized algorithms for management of resources and data may not be sufficient to manage complex distributed systems. In this paper, we present a distributed algorithm for resource and data management which builds on the traditional simplex algorithm used for solving linear optimization problems. Our distributed algorithm is an exact one meaning its results are identical if run in a centralized setting. We provide extensive analytical results and experiments on simulated data to demonstrate the performance of our algorithm.
Keywords :
data mining; linear programming; data fusion; data mining; distributed environments; distributed linear programming; information synchronization; resource management; Access control; Data mining; Data security; Distributed algorithms; Distributed computing; Information security; Large-scale systems; Linear programming; Monitoring; Resource management; Distributed Linear Programming; Distributed Resource Management Via Linear Programming; Simplex Algorithm;
Conference_Titel :
Data Mining Workshops, 2008. ICDMW '08. IEEE International Conference on
Conference_Location :
Pisa
Print_ISBN :
978-0-7695-3503-6
Electronic_ISBN :
978-0-7695-3503-6
DOI :
10.1109/ICDMW.2008.137