• DocumentCode
    2754455
  • Title

    Finding Hierarchical Frequent Items in Data Streams

  • Author

    Feng, Wenfeng ; Guo, Qiao ; Zhang, Zhibin

  • Author_Institution
    Network Inf. Center, Beijing Inst. of Technol.
  • Volume
    2
  • fYear
    0
  • fDate
    0-0 0
  • Firstpage
    5972
  • Lastpage
    5976
  • Abstract
    A hierarchical sketch was implemented to summarize the hierarchical structure in stream data. The sketch used a XOR-based pair-wise independent family of hash functions on the hierarchical domain to map stream data items to a three dimensional array of counters of size LtimesDtimesW. Of the counter array, L was the layers in hierarchy, D was the number of uniformly and randomly chosen hash functions, and W was the range of hash functions. Based on the sketch, an algorithm that identified and evaluated the hierarchical frequent items over data streams approximately was implemented. This algorithm has sub-linear time and space costs and is almost exact in statistic meaning
  • Keywords
    approximation theory; data structures; XOR-based pair-wise independent family; approximate algorithm; data streams; hash functions; hierarchical data stream structure; hierarchical frequent items; hierarchical sketch; stream data items mapping; summary data structure; Computer science; Costs; Counting circuits; Data engineering; Data structures; Electronic mail; Gold; Intelligent control; Intelligent networks; Statistics; approximate algorithms; data stream; hierarchical frequent items; summary data structure;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Intelligent Control and Automation, 2006. WCICA 2006. The Sixth World Congress on
  • Conference_Location
    Dalian
  • Print_ISBN
    1-4244-0332-4
  • Type

    conf

  • DOI
    10.1109/WCICA.2006.1714225
  • Filename
    1714225