• DocumentCode
    559653
  • Title

    Digitizing strategy on the same ontology in heterogeneous data source

  • Author

    Jiaoxiong, Xia ; Mengfang, Li ; Minjie, Bian ; Jun, Xu

  • Author_Institution
    Sch. of Opt.-Electr. & Comput. Eng., Univ. of Shanghai for Sci. & Technol., Shanghai, China
  • fYear
    2011
  • fDate
    24-26 Oct. 2011
  • Firstpage
    81
  • Lastpage
    85
  • Abstract
    The existence of the data objects, having the same ontology in heterogeneous data source (SO-HDS), is always the difficulty in cleaning process. Nowadays, there are several matching algorithms which can detect these data, such as Descartes Method, Enhanced Descartes Method and Priority Queue Algorithm. All these algorithms detect the similarity among the data directly without any pre-process on the original data. In this paper, we put forward a digitizing strategy on matching data objects based on the ontology of data object. When the data objects have the feature of SO-HDS, the storage mode and expression of these data objects can be ignored. We also propose a new data matching algorithm to find out the data objects having SO-HDS with the help of physics store attribute of data object. The new digitizing strategy will reduce the comparison amongst data objects, and keep the accuracy at the same time.
  • Keywords
    data handling; ontologies (artificial intelligence); Descartes method; data object matching; data similarity; digitizing strategy; enhanced Descartes method; heterogeneous data source; ontology; priority queue algorithm; Data Cleaning; Data matching; Digitize; Ontology; the same ontology in heterogeneous data source(SO-HDS);
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Mining and Intelligent Information Technology Applications (ICMiA), 2011 3rd International Conference on
  • Conference_Location
    Macao
  • Print_ISBN
    978-1-4673-0231-9
  • Type

    conf

  • Filename
    6108403