Muestra métricas de impacto externas asociadas a la publicación. Para mayor detalle:
| Indexado |
|
||||
| DOI | 10.1109/WCNC51071.2022.9771776 | ||||
| Año | 2022 | ||||
| Tipo | proceedings paper |
Citas Totales
Autores Afiliación Chile
Instituciones Chile
% Participación
Internacional
Autores
Afiliación Extranjera
Instituciones
Extranjeras
In the wireless communication, deep reinforcement learning (DRL) techniques promise performance optimizations at a low cost. Considering the time-varying property of the wireless downlink channels, this paper proposes a deep deterministic policy gradient (DDPG) approach and a hierarchical DDPG (h-DDPG) approach to optimize the sum-rate at the user equipment (UE) side, by jointly designing the power control and the beam-forming at the base station (BS). Our results demonstrate that the proposed DDPG enables continuous data representation through the deterministic policy functions, while the proposed h-DDPG is able to mitigate the sparse reward problem. Both of the two DRL algorithms are superior to the conventional deep Q-learning (DQN) algorithm, in terms of improving the communication performance over the time-varying wireless downlink channels.
| Ord. | Autor | Género | Institución - País |
|---|---|---|---|
| 1 | Liu, Mengfan | - |
Imperial College London - Reino Unido
Imperial Coll London - Reino Unido |
| 2 | Wang, Rui | - |
Tongji University - China
Tongji Univ - China |
| 3 | Xing, Zhe | - |
Tongji University - China
Tongji Univ - China |
| 4 | Soto, Ismael | Hombre |
Universidad de Santiago de Chile - Chile
|
| 5 | IEEE | Corporación |
| Fuente |
|---|
| Fondef |
| National Natural Science Foundation of China |
| Natural Science Foundation of Shanghai |
| Fondo de Fomento al Desarrollo Científico y Tecnológico |
| Science and Technology Innovation Plan Of Shanghai Science and Technology Commission |
| National Science Foundation China |
| Shanghai Science and Technology Innovation Action Plan Project |
| Agradecimiento |
|---|
| VIII. ACKNOWLEDGEMENTS The work of was Rui Wang was supported by the National Science Foundation China under Grant 61771345, Shanghai Science and Technology Innovation Action Plan Project No. 21220713100, and the Natural Science Foundation of Shanghai under Grant 22ZR1465100. The work of Ismael Soto was supported by Project Dicyt-062117S and FONDEF No. ID21I10191. |
| The work of was Rui Wang was supported by the National Science Foundation China under Grant 61771345, Shanghai Science and Technology Innovation Action Plan Project No. 21220713100, and the Natural Science Foundation of Shanghai under Grant 22ZR1465100. The work of Ismael Soto was supported by Project Dicyt-062117S and FONDEF No. ID21I10191. |