Title :
Implementation of multiple-precision parallel division and square root on distributed-memory parallel computers
Author :
Takahashi, Daisuke
Author_Institution :
Dept. of Inf. & Comput. Sci., Saitama Univ., Urawa, Japan
Abstract :
We present efficient parallel algorithms for multiple-precision division and square root operation of more than several million decimal digits on distributed-memory parallel computers. It is well known that multiple-precision division and square root operations can be reduced to multiple-precision addition, subtraction, and multiplication by using Newton iteration. Because a key operation in fast multiple-precision arithmetic is multiplication, a parallel implementation of floating-point real FFT-based multiplication is used. We also parallelized an operation of releasing propagated carries and borrows in multiple-precision addition, subtraction and multiplication. In parallel implementation of Newton iteration based multiple-precision division and square root operation, there is a tradeoff between load balance and communication overhead on distributed-memory parallel computers. An efficient data distribution for multiple-precision division and square root operation by using Newton iteration is given with confirmation of the theoretical analysis. We achieved high performance multiple-precision calculation of division and square root operation on distributed-memory parallel computers
Keywords :
Newton method; distributed memory systems; fast Fourier transforms; floating point arithmetic; mathematics computing; parallel algorithms; parallel machines; resource allocation; FFT; Newton iteration; addition; communication overhead; data distribution; distributed-memory parallel computers; fast Fourier transform; floating-point arithmetic; high performance; load balancing; multiple-precision arithmetic; multiple-precision parallel division; multiplication; parallel algorithms; square root operation; subtraction; Arithmetic; Concurrent computing; Distributed computing; Fast Fourier transforms; High performance computing; Packaging machines; Parallel algorithms; Parallel processing; Reduced instruction set computing; Software packages;
Conference_Titel :
Parallel Processing, 2000. Proceedings. 2000 International Workshops on
Conference_Location :
Toronto, Ont.
Print_ISBN :
0-7695-0771-9
DOI :
10.1109/ICPPW.2000.869107