DocumentCode
559653
Title
Digitizing strategy on the same ontology in heterogeneous data source
Author
Jiaoxiong, Xia ; Mengfang, Li ; Minjie, Bian ; Jun, Xu
Author_Institution
Sch. of Opt.-Electr. & Comput. Eng., Univ. of Shanghai for Sci. & Technol., Shanghai, China
fYear
2011
fDate
24-26 Oct. 2011
Firstpage
81
Lastpage
85
Abstract
The existence of the data objects, having the same ontology in heterogeneous data source (SO-HDS), is always the difficulty in cleaning process. Nowadays, there are several matching algorithms which can detect these data, such as Descartes Method, Enhanced Descartes Method and Priority Queue Algorithm. All these algorithms detect the similarity among the data directly without any pre-process on the original data. In this paper, we put forward a digitizing strategy on matching data objects based on the ontology of data object. When the data objects have the feature of SO-HDS, the storage mode and expression of these data objects can be ignored. We also propose a new data matching algorithm to find out the data objects having SO-HDS with the help of physics store attribute of data object. The new digitizing strategy will reduce the comparison amongst data objects, and keep the accuracy at the same time.
Keywords
data handling; ontologies (artificial intelligence); Descartes method; data object matching; data similarity; digitizing strategy; enhanced Descartes method; heterogeneous data source; ontology; priority queue algorithm; Data Cleaning; Data matching; Digitize; Ontology; the same ontology in heterogeneous data source(SO-HDS);
fLanguage
English
Publisher
ieee
Conference_Titel
Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
Conference_Location
Macao
Print_ISBN
978-1-4673-0231-9
Type
conf
Filename
6108403
Link To Document