• DocumentCode
    3426135
  • Title

    Detecting Distributed Scans Using High-Performance Query-Driven Visualization

  • Author

    Stockinger, Kurt ; Bethel, E. Wes ; Campbell, Scott ; Dart, Eli ; Wu, Kesheng

  • Author_Institution
    Computational Res. Div., Univ. of California, Berkeley, CA
  • fYear
    2006
  • fDate
    Nov. 2006
  • Firstpage
    39
  • Lastpage
    39
  • Abstract
    Modern forensic analytics applications, like network traffic analysis, perform high-performance hypothesis testing, knowledge discovery and data mining on very large datasets. One essential strategy to reduce the time required for these operations is to select only the most relevant data records for a given computation. In this paper, we present a set of parallel algorithms that demonstrate how an efficient selection mechanism - bitmap indexing - significantly speeds up a common analysis task, namely, computing conditional histogram on very large datasets. We present a thorough study of the performance characteristics of the parallel conditional histogram algorithms. As a case study, we compute conditional histograms for detecting distributed scans hidden in a dataset consisting of approximately 2.5 billion network connection records. We show that these conditional histograms can be computed on interactive time scale (i.e., in seconds). We also show how to progressively modify the selection criteria to narrow the analysis and find the sources of the distributed scans
  • Keywords
    computer networks; data mining; data visualisation; database indexing; parallel algorithms; query processing; statistical analysis; telecommunication computing; telecommunication security; telecommunication traffic; very large databases; bitmap indexing; data mining; distributed scan detection; forensic analytics application; high-performance hypothesis testing; knowledge discovery; network security; network traffic analysis; parallel conditional histogram algorithm; query-driven visualization; very large dataset; Data mining; Data visualization; Forensics; Histograms; Indexing; Parallel algorithms; Performance analysis; Performance evaluation; Telecommunication traffic; Testing; connection analysis; data mining; network; network security; query-driven visualization; visual analytics;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    SC 2006 Conference, Proceedings of the ACM/IEEE
  • Conference_Location
    Tampa, FL
  • Print_ISBN
    0-7695-2700-0
  • Electronic_ISBN
    0-7695-2700-0
  • Type

    conf

  • DOI
    10.1109/SC.2006.25
  • Filename
    4090213