• DocumentCode
    3240524
  • Title

    Merging Multiple Data Streams on Common Keys over High Performance Networks

  • Author

    Mazzucco, Marco ; Ananthanarayan, Asvin ; Grossman, Robert L. ; Levera, Jorge ; Rao, Gokulnath Bhagavantha

  • Author_Institution
    University of Illinois at Chicago
  • fYear
    2002
  • fDate
    16-22 Nov. 2002
  • Firstpage
    67
  • Lastpage
    67
  • Abstract
    The model for data mining on streaming data assumes that there is a buffer of fixed length and a data stream of infinite length and the challenge is to extract patterns, changes, anomalies, and statistically significant structures by examining the data one time and storing records and derived attributes of length less than N. As data grids, data webs, and semantic webs become more common, mining distributed streaming data will become more and more important. The first step when presented with two or more distributed streams is to merge them using a common key. In this paper, we present two algorithms for merging streaming data using a common key. We also present experimental studies showing these algorithms scale in practice to OC-12 networks.
  • Keywords
    Buffer storage; Clustering algorithms; Computer networks; Data analysis; Data mining; High performance computing; Laboratories; Merging; Sensor systems and applications; Switches;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Supercomputing, ACM/IEEE 2002 Conference
  • ISSN
    1063-9535
  • Print_ISBN
    0-7695-1524-X
  • Type

    conf

  • DOI
    10.1109/SC.2002.10044
  • Filename
    1592903