Title :
An Efficient GPU Implementation of Ant Colony Optimization for the Traveling Salesman Problem
Author :
Uchida, Akira ; Ito, Yu ; Nakano, Kaoru
Author_Institution :
Dept. of Inf. Eng., Hiroshima Univ., Higashi Hiroshima, Japan
Abstract :
Graphics Processing Units (GPUs) are specialized microprocessors that accelerate graphics operations. Recent GPUs, which have many processing units connected with an off-chip global memory, can be used for general purpose parallel computation. Ant Colony Optimization (ACO) approaches have been introduced as ature-inspired heuristics to find good solutions of the Traveling Salesman Problem (TSP). In ACO approaches, a number of ants traverse the cities of the TSP to find better solutions of the TSP. The ants randomly select next visiting cities based on the probabilities determined by total amounts of their pheromone spread on routes. The main contribution of this paper is to present sophisticated and efficient implementation of one of the ACO approaches on the GPU. In our implementation, we have considered many programming issues of the GPU architecture including coalesced access of global memory, shared memory bank conflicts, etc. In particular, we present a very efficient method for random selection of next cities by a number of ants. Our new method uses iterative random trial which can find next cities in few computational costs with high probability. The experimental results on NVIDIA GeForce GTX 580 show that our implementation for 1002 cities runs in 8.71 seconds, while a conventional CPU implementation runs in 381.95 seconds. Thus, our GPU implementation attains a speed-up factor of 43.47.
Keywords :
ant colony optimisation; graphics processing units; travelling salesman problems; GPU architecture; NVIDIA GeForce GTX 580; ant colony optimization; efficient GPU implementation; general purpose parallel computation; graphics operations; graphics processing units; iterative random trial; nature inspired heuristics; off chip global memory; pheromone; programming; random selection; shared memory bank conflicts; specialized microprocessors; traveling salesman problem; Ant colony optimization; Arrays; Cities and towns; Graphics processing units; Indexes; Instruction sets; Ant Colony Optimization; CUDA; GPU; Parallel Processing; Traveling Salesman Problem;
Conference_Titel :
Networking and Computing (ICNC), 2012 Third International Conference on
Conference_Location :
Okinawa
Print_ISBN :
978-1-4673-4624-5
DOI :
10.1109/ICNC.2012.22