A reinforcement learning-based Bi-objective routing algorithm for energy harvesting mobile ad-hoc networks

Author

Maleki, Mehdi ; Hakami, Vesal ; Dehghan, Mehdi

Author_Institution

Dept. of Comput. Eng., Amirkabir Univ. of Technol., Tehran, Iran

fYear

2014

fDate

9-11 Sept. 2014

Firstpage

1082

Lastpage

1087

Abstract

Dynamic topology, lack of a fixed infrastructure and limited energy in mobile ad-hoc networks (MANETs) give rise to a challenging operational environment. MANET routing protocols should consider dynamic network changes (e.g., link qualities and nodes´ residual energy) in such circumstances and be able to adapt to these changes to efficiently handle the traffic flows. In this paper, we present a bi-objective intelligent routing protocol that aims at reducing an expected long-run cost function composed of end-to-end delay and the path energy cost. We formulate the routing problem as a Markov decision process (MDP) which captures both the link state dynamics due to node mobility and energy state dynamics due to nodes´ rechargeable energy sources. We propose a reinforcement learning-based algorithm to approximate the optimal routing policy in the absence of a priori knowledge of the system statistics. We compare the performance of the proposed scheme with that obtained from a value-iteration-based algorithm which assumes perfect statistics.

Keywords

Markov processes; energy harvesting; iterative methods; learning (artificial intelligence); mobile ad hoc networks; routing protocols; telecommunication computing; telecommunication network routing; telecommunication power management; telecommunication traffic; MANET routing protocols; MDP fomulation; Markov decision process; a priori knowledge absence; dynamic topology; end-to-end delay; energy harvesting mobile Ad-hoc networks; fixed infrastructure; node mobility; nodes rechargeable energy sources; path energy cost; reinforcement learning-based bi-objective routing algorithm; traffic flows; value-iteration-based algorithm; Ad hoc networks; Cost function; Delays; Energy states; Heuristic algorithms; Mobile computing; Routing; MANET; MDP; Reinforcement Learning; end to end delay; network life time; routing;

fLanguage

English

Publisher

ieee

Conference_Titel

Telecommunications (IST), 2014 7th International Symposium on

Conference_Location

Tehran

Print_ISBN

978-1-4799-5358-5

Type

conf

DOI

10.1109/ISTEL.2014.7000865

Filename

7000865