• DocumentCode
    3335188
  • Title

    DIInCX: An Approach to Discovery of Implicit Integrity Constraints from XML Data

  • Author

    Rodrigues, Khaue Rezende ; Mello, Ronaldo Dos Santos

  • Author_Institution
    Univ. Fed. de Santa Catarina-UFSC, Santa Catarina
  • fYear
    2007
  • fDate
    13-15 Aug. 2007
  • Firstpage
    606
  • Lastpage
    611
  • Abstract
    We propose an approach for discovery of implicit semantic integrity constraints (SIC) from XML instances called DIInCX. DIInCX is a process composed by three phases: preprocessing, discovering and conversion. Our motivation with this work is to improve the activity of XML semantic data integration or XML information extraction systems, complementing their resulting XML schemata with SIC rules that cannot be explicitly perceived by a human user. Our approach is validated through experiments that show that the discovered SIC rules are valid, human readable and not complex to be implemented because they are based on simple restrict conditions.
  • Keywords
    XML; data integrity; data mining; programming language semantics; DIInCX; XML information extraction systems; XML schemata; XML semantic data integration; semantic integrity constraints; Association rules; Data mining; Data models; Delta modulation; Humans; Integrated circuit modeling; Itemsets; Silicon carbide; Terminology; XML;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
  • Conference_Location
    Las Vegas, IL
  • Print_ISBN
    1-4244-1500-4
  • Electronic_ISBN
    1-4244-1500-4
  • Type

    conf

  • DOI
    10.1109/IRI.2007.4296687
  • Filename
    4296687