DocumentCode :
1909840
Title :
Ontologies for Reusing Data Cleaning Knowledge
Author :
Almeida, Ricardo ; Oliveira, Paulo ; Braga, Luís ; Barroso, João
Author_Institution :
Comput. Eng. Dept., Inst. Super. Eng. do Porto - IPP Porto, Porto, Portugal
fYear :
2012
fDate :
19-21 Sept. 2012
Firstpage :
238
Lastpage :
241
Abstract :
The emergence of new business models, namely, the establishment of partnerships between organizations, the chance that companies have of adding existing data on the web, especially in the semantic web, to their information, led to the emphasis on some problems existing in databases, particularly related to data quality. Poor data can result in loss of competitiveness of the organizations holding these data, and may even lead to their disappearance, since many of their decision-making processes are based on these data. For this reason, data cleaning is essential. Current approaches to solve these problems are closely linked to database schemas and specific domains. In order that data cleaning can be used in different repositories, it is necessary for computer systems to understand these data, i.e., an associated semantic is needed. The solution presented in this paper includes the use of ontologies: (i) for the specification of data cleaning operations and, (ii) as a way of solving the semantic heterogeneity problems of data stored in different sources. With data cleaning operations defined at a conceptual level and existing mappings between domain ontologies and an ontology that results from a database, they may be instantiated and proposed to the expert/specialist to be executed over that database, thus enabling their interoperability.
Keywords :
business data processing; data handling; formal specification; ontologies (artificial intelligence); open systems; semantic Web; software reusability; business models; computer systems; data cleaning knowledge reusing; data cleaning operation specification; data quality; database schemas; decision-making processes; domain ontologies; interoperability; semantic Web; semantic heterogeneity problems; Cleaning; Data models; Databases; OWL; Ontologies; Quality management; Semantics; Data Cleaning; Data Quality; Interoperability; Ontologies;
fLanguage :
English
Publisher :
ieee
Conference_Titel :
Semantic Computing (ICSC), 2012 IEEE Sixth International Conference on
Conference_Location :
Palermo
Print_ISBN :
978-1-4673-4433-3
Type :
conf
DOI :
10.1109/ICSC.2012.19
Filename :
6337110
Link To Document :
بازگشت