• DocumentCode
    3579812
  • Title

    Study of BDRM Asynchronous Parallel Computing Model Based on Multiple CUDA Streams

  • Author

    Xuehai Sun ; Lianglong Da ; Yuyang Li

  • Author_Institution
    Navy Underwater Battlefield Environ. Instn., Navy Submarine Acad., Qingdao, China
  • Volume
    1
  • fYear
    2014
  • Firstpage
    181
  • Lastpage
    184
  • Abstract
    In order to improve the computing speed of ocean acoustic field using the Beam-Displacement Ray-Mode (BDRM) theory, a BDRM parallel computing model based on Compute Unified Device Architecture (CUDA) is designed by virtue of the powerful parallel computing ability of GPU and the character of BDRM theory. The emphasis is how to implement parallel computing of eigen value and eigen function in CUDA programming model. The results of simulation experiment show that the CPU elapsed time increases fast but the GPU elapsed time increases slow with the frequency of the sound source reaching higher. The speedup in blue-water is bigger than that in shallow-water under the same frequency of the sound source. The speedups are 7.84× and 33.36× respectively in shallow-water and blue-water when the frequency of the sound source is 1000Hz. The BDRM parallel computing model based on CUDA has higher computing efficiency than the BDRM serial computing model based on CPU under large scale operations. It could achieve the requirement of fast forecast of ocean acoustic field and engineering application.
  • Keywords
    eigenvalues and eigenfunctions; graphics processing units; parallel architectures; parallel programming; BDRM asynchronous parallel computing model; BDRM serial computing model; CPU elapsed time; CUDA programming; Compute Unified Device Architecture; GPU elapsed time; GPU parallel computing ability; beam-displacement ray-mode theory; eigenfunction; eigenvalue; engineering application; graphics processing unit; multiple CUDA stream; ocean acoustic field; sound source frequency; Acoustics; Computational modeling; Eigenvalues and eigenfunctions; Graphics processing units; Instruction sets; Oceans; Parallel processing; BDRM; CUDA; acoustic field; eigenfunction; eigenvalue; parallel computing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Computational Intelligence and Design (ISCID), 2014 Seventh International Symposium on
  • Print_ISBN
    978-1-4799-7004-9
  • Type

    conf

  • DOI
    10.1109/ISCID.2014.104
  • Filename
    7064168