• DocumentCode
    1987628
  • Title

    Q-learning cell selection for femtocell networks: Single- and multi-user case

  • Author

    Dhahri, C. ; Ohtsuki, Tomoaki

  • Author_Institution
    Dept. of Comput. & Inf. Sci., Keio Univ., Yokohama, Japan
  • fYear
    2012
  • fDate
    3-7 Dec. 2012
  • Firstpage
    4975
  • Lastpage
    4980
  • Abstract
    In this paper, we focus on user-centered handover decision making in open-access non-stationary femtocell networks. Traditionally, such handover mechanism is usually based on a measured channel/cell quality metric such as the channel capacity (between the user and the target cell). However, the throughput experienced by the user is time-varying because of the channel condition, i.e. owing to the propagation effects or receiver location. In this context, user decision can depend not only on the current state of the network, but also on the future possible states (horizon). To this end, we need to implement a learning algorithm that can predict, based on the past experience, the best performing cell in the future. We present in this paper a reinforcement learning (RL) framework as a generic solution for the cell selection problem in a non-stationary femtocell network that selects, without prior knowledge about the environment, a target cell by exploring past cells behavior and predicting their potential future state based on Q-learning algorithm. Our algorithm aims at balancing the number of handovers and the user capacity taking into account the dynamic change of the environment. Simulation results demonstrate that our solution offers an opportunistic-like capacity performance with less number of handovers.
  • Keywords
    decision making; femtocellular radio; mobility management (mobile radio); Q-learning algorithm; Q-learning cell selection; capacity taking; channel-cell quality metric such; femtocell networks; handover mechanism; learning algorithm; multiuser case; nonstationary femtocell network; open-access nonstationary femtocell networks; opportunistic-like capacity performance; propagation effects; reinforcement learning; single user case; user-centered handover decision making;
  • fLanguage
    English
  • Publisher
    ieee
  • Conference_Titel
    Global Communications Conference (GLOBECOM), 2012 IEEE
  • Conference_Location
    Anaheim, CA
  • ISSN
    1930-529X
  • Print_ISBN
    978-1-4673-0920-2
  • Electronic_ISBN
    1930-529X
  • Type

    conf

  • DOI
    10.1109/GLOCOM.2012.6503908
  • Filename
    6503908