• DocumentCode
    2799721
  • Title

    Benchmarking XML-Schema Matching Algorithms for Improving Automated Tuning

  • Author

    Boukhebouze, Mohamed ; Rifaieh, Rami ; Benharkat, Nabila ; Amghar, Youssef

  • Author_Institution
    Nat. Inst. of Appl. Sci. of Lyon, Lyon
  • fYear
    2007
  • fDate
    13-16 May 2007
  • Firstpage
    917
  • Lastpage
    925
  • Abstract
    Several matching algorithms were recently developed in order to automate or semi-automate the process of correspondences discovery between XML schemas. These algorithms use a wide range of approaches and matching techniques covering linguistic similarity, structural similarity, constraints, etc. The final matching combines arithmetically different results stemmed from these techniques. The aggregation of the results uses often many parameters and weights to be adjusted manually. Generally, this task is achieved by human experts and requires a perfect understanding of the matching algorithm. In order to reduce the human intervention and improve matching quality, we suggest automating the tuning of the various structural parameters used within XML-Schema matching algorithms. In this work, we offer a benchmark, for three tools, that seeks mathematical relations between parameters values and schema topology. In consequent, we propose an algorithm for the tuning of these parameters for studied tools.
  • Keywords
    XML; XML-schema matching algorithm benchmarking; automated tuning; human intervention; linguistic similarity; mathematical relations; Floods; Humans; Industrial relations; Information systems; Structural engineering; Supercomputers; Terminology; Topology; Warehousing; XML; Automatic Tuning; Benchmark; Matching; XML Schemas;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computer Systems and Applications, 2007. AICCSA '07. IEEE/ACS International Conference on
  • Conference_Location
    Amman
  • Print_ISBN
    1-4244-1030-4
  • Electronic_ISBN
    1-4244-1031-2
  • Type

    conf

  • DOI
    10.1109/AICCSA.2007.370741
  • Filename
    4231069