• DocumentCode
    473301
  • Title

    Correlation-based Attribute Outlier Detection in XML

  • Author

    Koh, Judice L Y ; Lee, Mong Li ; Hsu, Wynne ; Ang, Wee Tiong

  • Author_Institution
    Sch. of Comput., Nat. Univ. of Singapore, Singapore
  • fYear
    2008
  • fDate
    7-12 April 2008
  • Firstpage
    1522
  • Lastpage
    1524
  • Abstract
    Compared to relational data models, the hierarchical structure of semi-structured data such as XML provides semantically meaningful neighbourhoods advancing data cleaning problems such as outlier detection. In this paper, we introduce the concept of correlated subspace that leverages on the hierarchical relationships between XML attributes to provide contextually informative neighbourhoods for attribute outlier detection. We also design two correlation-based attribute outlier metrics for XML, namely the xO-Measure and xQ-Measure. The effectiveness of our XML outlier detection approach is supported with experimental results.
  • Keywords
    XML; data structures; XML; correlation-based attribute outlier detection; xO-Measure; xQ-Measure; Cities and towns; Cleaning; Data models; Humans; Object detection; Pattern analysis; Stock markets; Virtual colonoscopy; Watches; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2008. ICDE 2008. IEEE 24th International Conference on
  • Conference_Location
    Cancun
  • Print_ISBN
    978-1-4244-1836-7
  • Electronic_ISBN
    978-1-4244-1837-4
  • Type

    conf

  • DOI
    10.1109/ICDE.2008.4497610
  • Filename
    4497610