DocumentCode :
480166
Title :
Reinforcing Records for Improving the Quality of Data Integration
Author :
Nie, Tiezheng ; Shen, Derong ; Yu, Ge ; Kou, Yue
Author_Institution :
Northeastern Univ.
Volume :
4
fYear :
2008
fDate :
12-14 Dec. 2008
Firstpage :
512
Lastpage :
515
Abstract :
In the data integration, the heterogeneity of sources leads to missing value and various expressions of the same value in records, which reduces the quality of data. In this paper, we propose a novel approach to reinforce records for the integrated data. By studying the functional dependency of attribute in schema of data integration, we discover the related attribute that determines the attribute with uncertain value. Then our approach exploits matching algorithms on the value of related attribute to associate different records. And the uncertain value will be reinforced with the consistent value in a certain record. We also propose algorithms for reinforcing dataset of data integration. The experiments based on the data of conference paper demonstrate the effectiveness and performance of our approach on improving the quality of data.
Keywords :
data integrity; data integration quality; functional dependency; reinforcing dataset; sources heterogeneity; Computer science; Data mining; Filling; Null value; Software engineering; Statistics; Web pages; XML; data integration; quality; reinforce;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Computer Science and Software Engineering, 2008 International Conference on
Conference_Location :
Wuhan, Hubei
Print_ISBN :
978-0-7695-3336-0
Type :
conf
DOI :
10.1109/CSSE.2008.626
Filename :
4722670
Link To Document :
بازگشت