• DocumentCode
    713935
  • Title

    Holistic Statistical Open Data integration based on integer linear programming

  • Author

    Berro, Alain ; Megdiche, Imen ; Teste, Olivier

  • Author_Institution
    IRIT, Univ. of Toulouse, Toulouse, France
  • fYear
    2015
  • fDate
    13-15 May 2015
  • Firstpage
    468
  • Lastpage
    479
  • Abstract
    Integrating several Statistical Open Data (SOD) tables is a very promising issue. Various analysis scenarios are hidden behind these statistical data, which makes it important to have a holistic view of them. However, as these data are scattered in several tables, it is a slow and costly process to use existing pairwise schema matching approaches to integrate several schemas of the tables. Hence, we need automatic tools that rapidly converge to a holistic integrated view of data and give a good matching quality. In order to accomplish this objective, we propose a new 0-1 linear program, which automatically resolves the problem of holistic OD integration. It performs global optimal solutions maximizing the profit of similarities between OD graphs. The program encompasses different constraints related to graph structures and matching setup, in particular 1:1 matching. It is solved using a standard solver (CPLEX) and experiments show that it can handle several input graphs and good matching quality compared to existing tools.
  • Keywords
    data integration; graph theory; integer programming; linear programming; statistical analysis; CPLEX; OD graphs; SOD tables; global optimal solutions; graph structures; holistic OD integration; holistic statistical open data integration; integer linear programming; pairwise schema matching approach; standard solver; Companies; Ontologies; Optimization; Plasmas; Polynomials; Production; Transforms; Linear Programming; Schema Matching; Statistical Open Data;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Research Challenges in Information Science (RCIS), 2015 IEEE 9th International Conference on
  • Conference_Location
    Athens
  • Type

    conf

  • DOI
    10.1109/RCIS.2015.7128908
  • Filename
    7128908