DocumentCode
2643528
Title
A multi-pass algorithm for sorting extremely large data files
Author
Shatnawi, Ali ; Alzahouri, Yathrip
Author_Institution
Dept. of Comput. Eng., Jordan Univ. of Sci. & Technol., Irbid, Jordan
fYear
2015
fDate
7-9 April 2015
Firstpage
79
Lastpage
82
Abstract
An extremely large data file is a file that is greater than the size of the main memory by multiple orders of magnitude. Sorting such a file involves external sorting algorithm, which uses both the hard disk and the main memory to accomplish the sorting task. Since the hard disk is much slower than the main memory, the number of hard disk input/output operations is considered the main performance metric. The new proposed method decreases the total number of input/output operations; hence, it reduces the total time of sorting. The proposed method has less number of disk read/write operations than currently existing approaches. The input/output complexity of the proposed algorithm is analyzed and compared with other algorithms. The proposed algorithm uses a constant merging order at the merge phase of the external sort with multiple passes over each set of data. It is shown that the proposed algorithm has lower sort time requirements than previous approaches.
Keywords
computational complexity; file organisation; merging; sorting; constant merging order; data file sorting; external sorting algorithm; extremely large data file; hard disk; input-output complexity; main memory; multipass algorithm; sort time requirements; Algorithm design and analysis; Communication systems; Complexity theory; Computers; Hard disks; Merging; Sorting; External Merge Sort; Large File Processing; Sorting;
fLanguage
English
Publisher
ieee
Conference_Titel
Information and Communication Systems (ICICS), 2015 6th International Conference on
Conference_Location
Amman
Type
conf
DOI
10.1109/IACS.2015.7103206
Filename
7103206
Link To Document