• DocumentCode
    1763861
  • Title

    Delay-Aware Two-Hop Cooperative Relay Communications via Approximate MDP and Stochastic Learning

  • Author

    Rui Wang ; Lau, Vincent K. N.

  • Author_Institution
    South Univ. of Sci. & Technol. of China, Shenzhen, China
  • Volume
    59
  • Issue
    11
  • fYear
    2013
  • fDate
    Nov. 2013
  • Firstpage
    7645
  • Lastpage
    7670
  • Abstract
    In this paper, a low-complexity delay-aware cross-layer scheduling algorithm for two-hop relay communication systems is proposed. The complex interactions of the queues at the source node and the M relay nodes (RSs) are modeled as an infinite horizon average reward Markov decision process (MDP), whose state space involves the joint queue state information (QSI) of the queues at the source node and the M RSs as well as the joint channel state information (CSI) of all S-R and R-D links. To address the curse of dimensionality, an equivalent MDP formulation is first proposed, where the system state depends only on global QSI. Furthermore, using approximate MDP and stochastic learning, an auction-based distributed online learning algorithm is derived, where each node iteratively estimates a per-node value function based on real-time observations of the local CSI and local QSI as well as signaling between relays. The combined distributed learning converges almost surely to a global optimal solution for large arrivals. Finally, it is showed by simulations that the proposed scheme achieves significant gain compared with various baselines such as the conventional CSIT-only control and the throughput optimal control (in stability sense).
  • Keywords
    Markov processes; delays; learning (artificial intelligence); queueing theory; relay networks (telecommunication); stochastic processes; telecommunication computing; CSI; Markov decision process; QSI; RS; approximate MDP; channel state information; delay aware two hop cooperative relay communications; distributed online learning algorithm; queue state information; relay nodes; source node; stochastic learning; Approximation algorithms; Delays; Joints; Optimization; Relays; Stability analysis; Stochastic processes; Cooperative communications; delay-aware resource allocation; distributive algorithm; stochastic optimization;
  • fLanguage
    English
  • Journal_Title
    Information Theory, IEEE Transactions on
  • Publisher
    ieee
  • ISSN
    0018-9448
  • Type

    jour

  • DOI
    10.1109/TIT.2013.2279895
  • Filename
    6587286