Title :
Attack-resistant frequency counting
Author :
Wu, Bo ; Saia, Jared ; King, Valerie
Author_Institution :
Dept. of Comput. Sci., Univ. of New Mexico, Albuquerque, NM, USA
Abstract :
We present collaborative peer-to-peer algorithms for the problem of approximating frequency counts for popular items distributed across the peers of a large-scale network. Our algorithms are attack-resistant in the sense that they function correctly even in the case where an adaptive and computationally unbounded adversary causes up to a 1/3 fraction of the peers in the network to suffer Byzantine faults. Our algorithms are scalable in the sense that all resource costs are polylogarithmic. Specifically, latency is O(log n); the number of messages and number of bits sent and received by each peer is O(log2n) per item; and number of neighbors of each peer is O(log2n). Our motivation for addressing this problem is to provide a tool for the following three applications: worm and virus detection; spam detection; and distributed data-mining. To the best of our knowledge, our algorithms are the first attack-resistant and scalable algorithms for this problem. Moreover, surprisingly, our algorithms seem to be the first attack-resistant algorithms for any data mining problem.
Keywords :
groupware; large-scale systems; peer-to-peer computing; security of data; Byzantine faults; attack resistant frequency counting; collaborative peer-to-peer algorithms; distributed data mining; large scale network; spam detection; virus detection; worm; Collaborative work; Computer networks; Computer science; Computer worms; Data mining; Fingerprint recognition; Frequency; Large-scale systems; Payloads; Peer to peer computing; Byzantine faults; butterfly; distributed; frequent items;
Conference_Titel :
Parallel & Distributed Processing (IPDPS), 2010 IEEE International Symposium on
Conference_Location :
Atlanta, GA
Print_ISBN :
978-1-4244-6442-5
DOI :
10.1109/IPDPS.2010.5470344