مرکز منطقه ای اطلاع رساني علوم و فناوري - Sesame: A User-Transparent Optimizing Framework for Many-Core Processors

DocumentCode :

611009

Title :

Sesame: A User-Transparent Optimizing Framework for Many-Core Processors

Author :

Jianbin Fang ; Varbanescu, Ana Lucia ; Sips, Henk

Author_Institution :

Parallel & Distrib. Syst. Group, Delft Univ. of Technol., Delft, Netherlands

fYear :

2013

fDate :

13-16 May 2013

Firstpage :

Lastpage :

Abstract :

With the integration of more computational cores and deeper memory hierarchies on modern processors, the performance gap between naively parallel zed code and optimized code becomes much larger than ever before. Very often, bridging the gap involves architecture-specific optimizations. These optimizations are difficult to implement by application programmers, who typically focus on the basic functionality of their code. Therefore, in this thesis, I focus on answering the following research question: "How can we address architecture-specific optimizations in a programmer-friendly way?\´\´ As an answer, I propose an optimizing framework for parallel applications running on many-core processors (textit{Sesame}). Taking a simple parallel zed code provided by the application programmers as input, Sesame chooses and applies the most suitable architecture-specific optimizations, aiming to improve the overall application performance in a user-transparent way. In this short paper, I present the motivation for designing and implementing Sesame, its structure and its modules. Furthermore, I describe the current status of Sesame, discussing our promising results in source-to-source vectorization, automated usage of local memory, and auto-tuning for implementation-specific parameters. Finally, I discuss my work-in-progress and sketch my ideas for finalizing Sesame\´s development and testing.

Keywords :

multiprocessing systems; parallel processing; Sesame; architecture-specific optimizations; computational cores; implementation-specific parameter auto tuning; local memory automated usage; many-core processor; memory hierarchy; optimized code; parallel applications; parallel zed code; source-to-source vectorization; user-transparent optimizing framework; Computational modeling; Graphics processing units; Kernel; Optimization; Parallel processing; Programming; Many-Core Processors; OpenCL; Optimization;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Cluster, Cloud and Grid Computing (CCGrid), 2013 13th IEEE/ACM International Symposium on

Conference_Location :

Delft

Print_ISBN :

978-1-4673-6465-2

Type :

conf

DOI :

10.1109/CCGrid.2013.79

Filename :

6546061

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=611009