DocumentCode
3595014
Title
Using ontologies for interoperability of data cleaning operations
Author
Almeida, Ricardo ; Oliveira, Paulo
Author_Institution
Dept. de Eng. Inf., Inst. Politec. do Porto, Porto, Portugal
fYear
2012
Firstpage
1
Lastpage
6
Abstract
The emergence of new business models, namely the establishment of partnerships between organizations, the possibility of companies to add existing data on the web, especially in the semantic web, to their information increase some problems already existing in the databases, particularly related to data quality. Poor data can lead to loss of competitiveness of the organizations holding these data and may even lead to their disappearance, since many of their decision-making are based on them. This makes data cleaning an essential process. The currently existing approaches to solve these problems are closely related with database schemas and specific domains. In order to use this process in different repositories, it is necessary that machines understand these data, i.e., it is necessary an associated semantic. The solution presented includes the use of ontologies: (i) for the specification of data cleaning operations and, (ii) as a way of solving the semantic heterogeneity problems of data stored in different databases. With the cleaning operations defined at the conceptual level and existing mappings between domain ontologies and an ontology associated with a database, they may be instantiated and then proposed to the user to be executed over that database, thus enabling their interoperability.
Keywords
data analysis; database management systems; ontologies (artificial intelligence); open systems; semantic Web; business models; data cleaning operations; data quality; database schemas; decision-making; domain ontologies; interoperability; semantic heterogeneity problems; semantic web; Cleaning; Data models; Databases; OWL; Ontologies; Quality management; Data Cleaning; Data Quality; Interoperability; Ontologies;
fLanguage
English
Publisher
ieee
Conference_Titel
Information Systems and Technologies (CISTI), 2012 7th Iberian Conference on
ISSN
2166-0727
Print_ISBN
978-1-4673-2843-2
Type
conf
Filename
6263214
Link To Document