DocumentCode
473301
Title
Correlation-based Attribute Outlier Detection in XML
Author
Koh, Judice L Y ; Lee, Mong Li ; Hsu, Wynne ; Ang, Wee Tiong
Author_Institution
Sch. of Comput., Nat. Univ. of Singapore, Singapore
fYear
2008
fDate
7-12 April 2008
Firstpage
1522
Lastpage
1524
Abstract
Compared to relational data models, the hierarchical structure of semi-structured data such as XML provides semantically meaningful neighbourhoods advancing data cleaning problems such as outlier detection. In this paper, we introduce the concept of correlated subspace that leverages on the hierarchical relationships between XML attributes to provide contextually informative neighbourhoods for attribute outlier detection. We also design two correlation-based attribute outlier metrics for XML, namely the xO-Measure and xQ-Measure. The effectiveness of our XML outlier detection approach is supported with experimental results.
Keywords
XML; data structures; XML; correlation-based attribute outlier detection; xO-Measure; xQ-Measure; Cities and towns; Cleaning; Data models; Humans; Object detection; Pattern analysis; Stock markets; Virtual colonoscopy; Watches; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
Conference_Location
Cancun
Print_ISBN
978-1-4244-1836-7
Electronic_ISBN
978-1-4244-1837-4
Type
conf
DOI
10.1109/ICDE.2008.4497610
Filename
4497610
Link To Document