• DocumentCode
    1229997
  • Title

    Computational complexity of sorting and joining relations with duplicates

  • Author

    Abdelguerfi, M. ; Sood, Arun K.

  • Author_Institution
    Dept. of Comput. Sci., New Orleans Univ., LA, USA
  • Volume
    3
  • Issue
    4
  • fYear
    1991
  • fDate
    12/1/1991 12:00:00 AM
  • Firstpage
    496
  • Lastpage
    503
  • Abstract
    It is shown that the existence of duplicate values in some attribute columns has a significant impact on the computational complexity of the sorting and joining operations. This is especially true when the number of distinct tuple values is a small fraction of the total number of tuples. The authors characterize a multirelation M (n, L) by its cardinality n and the number of distinct elements L it contains. Under this characterization, the worst time complexity of sorting such a multirelation with binary comparisons as basic operations is investigated. Upper and lower bounds on the number of three-branch comparisons needed to sort such a multirelation are established. Thereafter, the methodology used to study the complexity of sorting is applied to the natural join operation. It is shown that the existence of duplicate values in the join attribute columns can be exploited to reduce the computational complexity of the natural join operation
  • Keywords
    computational complexity; database theory; relational databases; sorting; attribute columns; binary comparisons; cardinality; computational complexity; distinct elements; distinct tuple values; duplicate values; joining; lower bounds; multirelation; natural join operation; relations; sorting; three-branch comparisons; upper bounds; worst time complexity; Application software; Computational complexity; Computer architecture; Computer science; Joining IEEE; Parallel architectures; Relational databases; Sorting;
  • fLanguage
    English
  • Journal_Title
    Knowledge and Data Engineering, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    1041-4347
  • Type

    jour

  • DOI
    10.1109/69.109110
  • Filename
    109110