Publications

Detailed Information

Rate Maximization with Reinforcement Learning for Time-Varying Energy Harvesting Broadcast Channels

Cited 0 time in Web of Science Cited 5 time in Scopus
Authors

Kim, Heasung; Shin, Wonjae; Yang, Heecheol; Lee, Nayoung; Lee, Jungwoo

Issue Date
2019-12
Publisher
IEEE
Citation
2019 IEEE GLOBAL COMMUNICATIONS CONFERENCE (GLOBECOM), p. 9013583
Abstract
In this paper, we consider a power allocation optimization technique for a time-varying fading broadcast channel in energy harvesting communication systems, in which a transmitter with a rechargeable battery transmits messages to receivers using the harvested energy. We first prove that the optimal online power allocation policy for the sum rate maximization of the transmitter is an increasing function of harvested energy, remaining battery, and each user's channel gain. We then construct an appropriate neural network by relying on increasing behavior of the optimal policy. This two-step approach, by using an effective function approximation as well as providing a fundamental guideline for neural network design, can prevent us from wasting the representational capacity of neural networks. On the basis of the neural network, we apply the policy gradient method to solve the power allocation problem. To validate the performance of our approach, we compare it with the closed-form the optimal policy in a partially observable Markov problem. Through further experiments, it is observed that our online solution achieves a performance close to the theoretical upper bound of the performance in a time-varying fading broadcast channel.
ISSN
2334-0983
URI
https://hdl.handle.net/10371/186923
DOI
https://doi.org/10.1109/GLOBECOM38437.2019.9013583
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share