DocumentCode
3335188
Title
DIInCX: An Approach to Discovery of Implicit Integrity Constraints from XML Data
Author
Rodrigues, Khaue Rezende ; Mello, Ronaldo Dos Santos
Author_Institution
Univ. Fed. de Santa Catarina-UFSC, Santa Catarina
fYear
2007
fDate
13-15 Aug. 2007
Firstpage
606
Lastpage
611
Abstract
We propose an approach for discovery of implicit semantic integrity constraints (SIC) from XML instances called DIInCX. DIInCX is a process composed by three phases: preprocessing, discovering and conversion. Our motivation with this work is to improve the activity of XML semantic data integration or XML information extraction systems, complementing their resulting XML schemata with SIC rules that cannot be explicitly perceived by a human user. Our approach is validated through experiments that show that the discovered SIC rules are valid, human readable and not complex to be implemented because they are based on simple restrict conditions.
Keywords
XML; data integrity; data mining; programming language semantics; DIInCX; XML information extraction systems; XML schemata; XML semantic data integration; semantic integrity constraints; Association rules; Data mining; Data models; Delta modulation; Humans; Integrated circuit modeling; Itemsets; Silicon carbide; Terminology; XML;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Reuse and Integration, 2007. IRI 2007. IEEE International Conference on
Conference_Location
Las Vegas, IL
Print_ISBN
1-4244-1500-4
Electronic_ISBN
1-4244-1500-4
Type
conf
DOI
10.1109/IRI.2007.4296687
Filename
4296687
Link To Document