DocumentCode :
2886902
Title :
Effect of Data Repair on Mining Network Streams
Author :
Ji Meng Loh ; Dasu, Tamraparni
Author_Institution :
AT&T Labs.-Res., Florham Park, NJ, USA
fYear :
2012
fDate :
10-10 Dec. 2012
Firstpage :
226
Lastpage :
233
Abstract :
Data quality issues have special implications in network data. Data glitches are propagated rapidly along pathways dictated by the hierarchy and topology of the network. In this paper, we use temporal data from a vast data network to study data glitches and their effect on network monitoring tasks such as anomaly detection. We demonstrate the consequences of cleaning the data, and develop targeted and customized cleaning strategies by exploiting the network hierarchy.
Keywords :
data mining; anomaly detection; customized cleaning strategies; data glitches; data quality issues; data repair; network data; network hierarchy; network monitoring tasks; network streams mining; temporal data; Cleaning; Context; Data mining; Information management; Maintenance engineering; Measurement; Time series analysis; Big Data; Data glitches; Earth Mover Distance; missing values; network analysis; outliers;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Data Mining Workshops (ICDMW), 2012 IEEE 12th International Conference on
Conference_Location :
Brussels
Print_ISBN :
978-1-4673-5164-5
Type :
conf
DOI :
10.1109/ICDMW.2012.125
Filename :
6406445
Link To Document :
بازگشت