مرکز منطقه ای اطلاع رساني علوم و فناوري - Architectural Support for Reducing Parallel Processing Overhead in an Embedded Multiprocessor

DocumentCode :

2372322

Title :

Architectural Support for Reducing Parallel Processing Overhead in an Embedded Multiprocessor

Author :

Wang, Jian ; Sohl, Joar ; Liu, Dake

Author_Institution :

Dept. of Electr. Eng., Linkoping Univ., Linköping, Sweden

fYear :

2010

fDate :

11-13 Dec. 2010

Firstpage :

Lastpage :

Abstract :

The host-multi-SIMD chip multiprocessor (CMP) architecture has been proved to be an efficient architecture for high performance signal processing which explores both task level parallelism by multi-core processing and data level parallelism by SIMD processors. Different from the cache-based memory subsystem in most general purpose processors, this architecture uses on-chip scratchpad memory (SPM) as processor local data buffer and allows software to explicitly control the data movements in the memory hierarchy. This SPM-based solution is more efficient for predictable signal processing in embedded systems where data access patterns are known at design time. The predictable performance is especially important for real time signal processing. According to Amdahl´s law, the nonparallelizable part of an algorithm has critical impact on the overall performance. Implementing an algorithm in a parallel platform usually produces control and communication overhead which is not parallelizable. This paper presents the architectural support in an embedded multiprocessor platform to maximally reduce the parallel processing overhead. The effectiveness of these architecture designs in boosting parallel performance is evaluated by an implementation example of 64×64 complex matrix multiplication. The result shows that the parallel processing overhead is reduced from 369% to 28%.

Keywords :

digital signal processing chips; embedded systems; multiprocessing systems; parallel processing; Amdahl law; cache-based memory subsystem; data level parallelism; embedded multiprocessor; high performance signal processing; host-multi-SIMD chip multiprocessor; multicore processing; on-chip scratchpad memory; parallel processing overhead reduction; Communication overhead; Control overhead; Matrix multiplication; Multiprocessor; Parallel DSP;

fLanguage :

English

Publisher :

ieee

Conference_Titel :

Embedded and Ubiquitous Computing (EUC), 2010 IEEE/IFIP 8th International Conference on

Conference_Location :

Hong Kong

Print_ISBN :

978-1-4244-9719-5

Electronic_ISBN :

978-0-7695-4322-2

Type :

conf

DOI :

10.1109/EUC.2010.17

Filename :

5703497

Link To Document :

https://search.ricest.ac.ir/dl/search/defaultta.aspx?DTC=49&DC=2372322