• DocumentCode
    1787263
  • Title

    A reinforcement learning-based Bi-objective routing algorithm for energy harvesting mobile ad-hoc networks

  • Author

    Maleki, Mehdi ; Hakami, Vesal ; Dehghan, Mehdi

  • Author_Institution
    Dept. of Comput. Eng., Amirkabir Univ. of Technol., Tehran, Iran
  • fYear
    2014
  • fDate
    9-11 Sept. 2014
  • Firstpage
    1082
  • Lastpage
    1087
  • Abstract
    Dynamic topology, lack of a fixed infrastructure and limited energy in mobile ad-hoc networks (MANETs) give rise to a challenging operational environment. MANET routing protocols should consider dynamic network changes (e.g., link qualities and nodes´ residual energy) in such circumstances and be able to adapt to these changes to efficiently handle the traffic flows. In this paper, we present a bi-objective intelligent routing protocol that aims at reducing an expected long-run cost function composed of end-to-end delay and the path energy cost. We formulate the routing problem as a Markov decision process (MDP) which captures both the link state dynamics due to node mobility and energy state dynamics due to nodes´ rechargeable energy sources. We propose a reinforcement learning-based algorithm to approximate the optimal routing policy in the absence of a priori knowledge of the system statistics. We compare the performance of the proposed scheme with that obtained from a value-iteration-based algorithm which assumes perfect statistics.
  • Keywords
    Markov processes; energy harvesting; iterative methods; learning (artificial intelligence); mobile ad hoc networks; routing protocols; telecommunication computing; telecommunication network routing; telecommunication power management; telecommunication traffic; MANET routing protocols; MDP fomulation; Markov decision process; a priori knowledge absence; dynamic topology; end-to-end delay; energy harvesting mobile Ad-hoc networks; fixed infrastructure; node mobility; nodes rechargeable energy sources; path energy cost; reinforcement learning-based bi-objective routing algorithm; traffic flows; value-iteration-based algorithm; Ad hoc networks; Cost function; Delays; Energy states; Heuristic algorithms; Mobile computing; Routing; MANET; MDP; Reinforcement Learning; end to end delay; network life time; routing;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Telecommunications (IST), 2014 7th International Symposium on
  • Conference_Location
    Tehran
  • Print_ISBN
    978-1-4799-5358-5
  • Type

    conf

  • DOI
    10.1109/ISTEL.2014.7000865
  • Filename
    7000865