• DocumentCode
    2079265
  • Title

    SMARTINT: A system for answering queries over web databases using attribute dependencies

  • Author

    Gummadi, Ravi ; Khulbe, Anupam ; Kalavagattu, Aravind ; Salvi, Sanil ; Kambhampati, Subbarao

  • Author_Institution
    Dept. of Comput. Sci., Arizona State Univ. Tempe, Tempe, AZ, USA
  • fYear
    2010
  • fDate
    1-6 March 2010
  • Firstpage
    1149
  • Lastpage
    1152
  • Abstract
    Many web databases can be seen as providing partial and overlapping information about entities in the world. To answer queries effectively, we need to integrate the information about the individual entities that are fragmented over multiple sources. At first blush this is just the inverse of traditional database normalization problem - rather than go from a universal relation to normalized tables, we want to reconstruct the universal relation given the tables (sources). The standard way of reconstructing the entities will involve joining the tables. Unfortunately, because of the autonomous and decentralized way in which the sources are populated, they often do not have Primary Key - Foreign Key relations. While tables do share attributes, naive joins over these shared attributes can result in reconstruction of many spurious entities thus seriously compromising precision. Our system, SMARTINT is aimed at addressing the problem of data integration in such scenarios. Given a query, our system uses the Approximate Functional Dependencies(AFDs) to piece together a tree of relevant tables and schemas for joining them. The result tuples produced by our system are able to strike a favorable balance between precision and recall.
  • Keywords
    Internet; database management systems; query processing; SMARTINT system; Web database; approximate functional dependencies; attribute dependencies; data integration; database normalization problem; normalized tables; primary key-foreign key relations; query answering system; universal relation; Computer science; Databases; Engine cylinders; Intrusion detection; Vehicles;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering (ICDE), 2010 IEEE 26th International Conference on
  • Conference_Location
    Long Beach, CA
  • Print_ISBN
    978-1-4244-5445-7
  • Electronic_ISBN
    978-1-4244-5444-0
  • Type

    conf

  • DOI
    10.1109/ICDE.2010.5447729
  • Filename
    5447729