Title :
A reinforcement learning-based Bi-objective routing algorithm for energy harvesting mobile ad-hoc networks
Author :
Maleki, Mehdi ; Hakami, Vesal ; Dehghan, Mehdi
Author_Institution :
Dept. of Comput. Eng., Amirkabir Univ. of Technol., Tehran, Iran
Abstract :
Dynamic topology, lack of a fixed infrastructure and limited energy in mobile ad-hoc networks (MANETs) give rise to a challenging operational environment. MANET routing protocols should consider dynamic network changes (e.g., link qualities and nodes´ residual energy) in such circumstances and be able to adapt to these changes to efficiently handle the traffic flows. In this paper, we present a bi-objective intelligent routing protocol that aims at reducing an expected long-run cost function composed of end-to-end delay and the path energy cost. We formulate the routing problem as a Markov decision process (MDP) which captures both the link state dynamics due to node mobility and energy state dynamics due to nodes´ rechargeable energy sources. We propose a reinforcement learning-based algorithm to approximate the optimal routing policy in the absence of a priori knowledge of the system statistics. We compare the performance of the proposed scheme with that obtained from a value-iteration-based algorithm which assumes perfect statistics.
Keywords :
Markov processes; energy harvesting; iterative methods; learning (artificial intelligence); mobile ad hoc networks; routing protocols; telecommunication computing; telecommunication network routing; telecommunication power management; telecommunication traffic; MANET routing protocols; MDP fomulation; Markov decision process; a priori knowledge absence; dynamic topology; end-to-end delay; energy harvesting mobile Ad-hoc networks; fixed infrastructure; node mobility; nodes rechargeable energy sources; path energy cost; reinforcement learning-based bi-objective routing algorithm; traffic flows; value-iteration-based algorithm; Ad hoc networks; Cost function; Delays; Energy states; Heuristic algorithms; Mobile computing; Routing; MANET; MDP; Reinforcement Learning; end to end delay; network life time; routing;
Conference_Titel :
Telecommunications (IST), 2014 7th International Symposium on
Conference_Location :
Tehran
Print_ISBN :
978-1-4799-5358-5
DOI :
10.1109/ISTEL.2014.7000865