• DocumentCode
    2507539
  • Title

    Discovery of high-dimensional inclusion dependencies

  • Author

    Koeller, Andreas ; Rundensteiner, Elke A.

  • Author_Institution
    Dept. of Comput. Sci., Montclair State Univ., Upper Montclair, NJ, USA
  • fYear
    2003
  • fDate
    5-8 March 2003
  • Firstpage
    683
  • Lastpage
    685
  • Abstract
    Determining relationships such as functional or inclusion dependencies within and across databases is important for many applications in information integration. When such information is not available as explicit meta data, it is possible to discover potential dependencies from the source database extents. However, the complexity of such discovery problems is typically exponential in the number of attributes. We have developed an algorithm for the discovery of inclusion dependencies across high-dimensional relations in the order of 100 attributes. This algorithm is the first to efficiently solve the inclusion-dependency discovery problem. This is achieved by mapping it into a progressive series of clique-finding problems in k-uniform hypergraphs and solving those. Extensive experimental studies confirm the algorithm´s efficiency on a variety of real-world data sets.
  • Keywords
    computational complexity; data mining; graph theory; meta data; set theory; clique-finding problem; computational complexity; databases; graph theory; high-dimensional inclusion dependencies discovery; hypergraphs; information integration; meta data; Application software; Association rules; Companies; Computer science; Data mining; Databases; Diseases; Instruments; Medical treatment; Redundancy;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Data Engineering, 2003. Proceedings. 19th International Conference on
  • Print_ISBN
    0-7803-7665-X
  • Type

    conf

  • DOI
    10.1109/ICDE.2003.1260834
  • Filename
    1260834