• DocumentCode
    2643528
  • Title

    A multi-pass algorithm for sorting extremely large data files

  • Author

    Shatnawi, Ali ; Alzahouri, Yathrip

  • Author_Institution
    Dept. of Comput. Eng., Jordan Univ. of Sci. & Technol., Irbid, Jordan
  • fYear
    2015
  • fDate
    7-9 April 2015
  • Firstpage
    79
  • Lastpage
    82
  • Abstract
    An extremely large data file is a file that is greater than the size of the main memory by multiple orders of magnitude. Sorting such a file involves external sorting algorithm, which uses both the hard disk and the main memory to accomplish the sorting task. Since the hard disk is much slower than the main memory, the number of hard disk input/output operations is considered the main performance metric. The new proposed method decreases the total number of input/output operations; hence, it reduces the total time of sorting. The proposed method has less number of disk read/write operations than currently existing approaches. The input/output complexity of the proposed algorithm is analyzed and compared with other algorithms. The proposed algorithm uses a constant merging order at the merge phase of the external sort with multiple passes over each set of data. It is shown that the proposed algorithm has lower sort time requirements than previous approaches.
  • Keywords
    computational complexity; file organisation; merging; sorting; constant merging order; data file sorting; external sorting algorithm; extremely large data file; hard disk; input-output complexity; main memory; multipass algorithm; sort time requirements; Algorithm design and analysis; Communication systems; Complexity theory; Computers; Hard disks; Merging; Sorting; External Merge Sort; Large File Processing; Sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Information and Communication Systems (ICICS), 2015 6th International Conference on
  • Conference_Location
    Amman
  • Type

    conf

  • DOI
    10.1109/IACS.2015.7103206
  • Filename
    7103206