DocumentCode
1097558
Title
Distributed Parallel Support Vector Machines in Strongly Connected Networks
Author
Lu, Yumao ; Roychowdhury, Vwani ; Vandenberghe, Lieven
Author_Institution
Yahoo! Inc., Sunnyvale, CA
Volume
19
Issue
7
fYear
2008
fDate
7/1/2008 12:00:00 AM
Firstpage
1167
Lastpage
1178
Abstract
In this paper, we propose a distributed parallel support vector machine (DPSVM) training mechanism in a configurable network environment for distributed data mining. The basic idea is to exchange support vectors among a strongly connected network (SCN) so that multiple servers may work concurrently on distributed data set with limited communication cost and fast training speed. The percentage of servers that can work in parallel and the communication overhead may be adjusted through network configuration. The proposed algorithm further speeds up through online implementation and synchronization. We prove that the global optimal classifier can be achieved iteratively over an SCN. Experiments on a real-world data set show that the computing time scales well with the size of the training data for most networks. Numerical results show that a randomly generated SCN may achieve better performance than the state of the art method, cascade SVM, in terms of total training time.
Keywords
data mining; iterative methods; network theory (graphs); optimisation; parallel processing; pattern classification; support vector machines; configurable network environment; distributed data mining; distributed parallel support vector machine; global optimal classifier; iterative method; strongly connected network; Convergence; distributed data mining; parallel computing; strongly connected networks (SCNs); support vector machine (SVM);
fLanguage
English
Journal_Title
Neural Networks, IEEE Transactions on
Publisher
ieee
ISSN
1045-9227
Type
jour
DOI
10.1109/TNN.2007.2000061
Filename
4470008
Link To Document