• DocumentCode
    2486055
  • Title

    Distributed randomized algorithms for low-support data mining

  • Author

    Ferro, Alfredo ; Giugno, Rosalba ; Mongiovì, Misael ; Pulvirenti, Alfredo

  • Author_Institution
    Dept. of Math. & Comput. Sci., Univ. of Catania, Catania, Italy
  • fYear
    2009
  • fDate
    23-29 May 2009
  • Firstpage
    1
  • Lastpage
    7
  • Abstract
    Data mining in distributed systems has been facilitated by using high-support association rules. Less attention has been paid to distributed low-support/high-correlation data mining. This has proved useful in several fields such as computational biology, wireless networks, web mining, security and rare events analysis in industrial plants. In this paper we present distributed versions of efficient algorithms for low-support/high-correlation data mining such as Min-Hashing, K-Min-Hashing and Locality-Sensitive-Hashing. Experimental results on real data concerning scalability, speed-up and network traffic are reported.
  • Keywords
    data mining; distributed algorithms; file organisation; randomised algorithms; computational biology; distributed high-correlation data mining; distributed randomized algorithms; high-support association rules; industrial plants; k-min-hashing; locality-sensitive-hashing; low-support data mining; min-hashing; rare events analysis; security; web mining; wireless networks; Association rules; Bioinformatics; Computational biology; Data mining; Matrix decomposition; Partitioning algorithms; Scalability; Transaction databases; Web mining; Workstations;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing, 2009. IPDPS 2009. IEEE International Symposium on
  • Conference_Location
    Rome
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-4244-3751-1
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2009.5161156
  • Filename
    5161156