• DocumentCode
    2886902
  • Title

    Effect of Data Repair on Mining Network Streams

  • Author

    Ji Meng Loh ; Dasu, Tamraparni

  • Author_Institution
    AT&T Labs.-Res., Florham Park, NJ, USA
  • fYear
    2012
  • fDate
    10-10 Dec. 2012
  • Firstpage
    226
  • Lastpage
    233
  • Abstract
    Data quality issues have special implications in network data. Data glitches are propagated rapidly along pathways dictated by the hierarchy and topology of the network. In this paper, we use temporal data from a vast data network to study data glitches and their effect on network monitoring tasks such as anomaly detection. We demonstrate the consequences of cleaning the data, and develop targeted and customized cleaning strategies by exploiting the network hierarchy.
  • Keywords
    data mining; anomaly detection; customized cleaning strategies; data glitches; data quality issues; data repair; network data; network hierarchy; network monitoring tasks; network streams mining; temporal data; Cleaning; Context; Data mining; Information management; Maintenance engineering; Measurement; Time series analysis; Big Data; Data glitches; Earth Mover Distance; missing values; network analysis; outliers;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
  • Conference_Location
    Brussels
  • Print_ISBN
    978-1-4673-5164-5
  • Type

    conf

  • DOI
    10.1109/ICDMW.2012.125
  • Filename
    6406445