• DocumentCode
    1199741
  • Title

    Rank Aggregation for Automatic Schema Matching

  • Author

    Domshlak, Carmel ; Gal, Avigdor ; Roitman, Haggai

  • Author_Institution
    Fac. of Ind. Eng. & Manage., Technion-Israel Inst. of Technol., Haifa
  • Volume
    19
  • Issue
    4
  • fYear
    2007
  • fDate
    4/1/2007 12:00:00 AM
  • Firstpage
    538
  • Lastpage
    553
  • Abstract
    Schema matching is a basic operation of data integration, and several tools for automating it have been proposed and evaluated in the database community. Research in this area reveals that there is no single schema matcher that is guaranteed to succeed in finding a good mapping for all possible domains and, thus, an ensemble of schema matchers should be considered. In this paper, we introduce schema metamatching, a general framework for composing an arbitrary ensemble of schema matchers and generating a list of best ranked schema mappings. Informally, schema metamatching stands for computing a "consensus" ranking of alternative mappings between two schemata, given the "individual" graded rankings provided by several schema matchers. We introduce several algorithms for this problem, varying from adaptations of some standard techniques for general quantitative rank aggregation to novel techniques specific to the problem of schema matching, and to combinations of both. We provide a formal analysis of the applicability and relative performance of these algorithms and evaluate them empirically on a set of real-world schemata
  • Keywords
    data integrity; database management systems; pattern matching; arbitrary ensemble; automatic schema metamatching; consensus rank aggregation; data integration; database community; individual graded rankings; quantitative rank aggregation; ranked schema mappings; Algorithm design and analysis; Databases; Floods; HTML; Humans; Large-scale systems; Performance analysis; Semantic Web; Web services; XML; Database integration; rank aggregation.; schema matching;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/TKDE.2007.1010
  • Filename
    4118710