• DocumentCode
    3206527
  • Title

    I/O-Optimal Distribution Sweeping on Private-Cache Chip Multiprocessors

  • Author

    Ajwani, Deepak ; Sitchinava, Nodari ; Zeh, Norbert

  • Author_Institution
    Centre for Unified Comput., Univ. Coll. Cork, Cork, Ireland
  • fYear
    2011
  • fDate
    16-20 May 2011
  • Firstpage
    1114
  • Lastpage
    1123
  • Abstract
    The parallel external memory (PEM) model has been used as a basis for the design and analysis of a wide range of algorithms for private-cache multi-core architectures. As a tool for developing geometric algorithms in this model, a parallel version of the I/O-efficient distribution sweeping framework was introduced recently, and a number of algorithms for problems on axis-aligned objects were obtained using this framework. The obtained algorithms were efficient but not optimal. In this paper, we improve the framework to obtain algorithms with the optimal I/O complexity of O(sortp(N) + K/PB) for a number of problems on axis aligned objects; P denotes the number of cores/processors, B denotes the number of elements that fit in a cache line, N and K denote the sizes of the input and output, respectively, and sortp(N) denotes the I/O complexity of sorting N items using P processors in the PEM model. To obtain the above improvement, we present a new one-dimensional batched range counting algorithm on a sorted list of ranges and points that achieves an I/O complexity of 0((N + K)/PB), where K is the sum of the counts of all the ranges. The key to achieving efficient load balancing among the processors in this algorithm is a new method to count the output without enumerating it, which might be of independent interest.
  • Keywords
    computational geometry; multiprocessing systems; I/O complexity; I/O-efficient distribution sweeping framework; I/O-optimal distribution sweeping; geometric algorithm; load balancing; one-dimensional batched range counting algorithm; parallel external memory model; private-cache chip multiprocessors; private-cache multicore architecture; Compaction; Complexity theory; Computational modeling; Load management; Multicore processing; Program processors; Sorting;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Parallel & Distributed Processing Symposium (IPDPS), 2011 IEEE International
  • Conference_Location
    Anchorage, AK
  • ISSN
    1530-2075
  • Print_ISBN
    978-1-61284-372-8
  • Electronic_ISBN
    1530-2075
  • Type

    conf

  • DOI
    10.1109/IPDPS.2011.106
  • Filename
    6012918