高级搜索

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名
邮箱
手机号码
标题
留言内容
验证码

基于强化学习的无人机基站多播通信系统的飞行路线在线优化

张广驰 严雨琳 崔苗 陈伟 张景

张广驰, 严雨琳, 崔苗, 陈伟, 张景. 基于强化学习的无人机基站多播通信系统的飞行路线在线优化[J]. 电子与信息学报, 2022, 44(3): 969-975. doi: 10.11999/JEIT210429
引用本文: 张广驰, 严雨琳, 崔苗, 陈伟, 张景. 基于强化学习的无人机基站多播通信系统的飞行路线在线优化[J]. 电子与信息学报, 2022, 44(3): 969-975. doi: 10.11999/JEIT210429
ZHANG Guangchi, YAN Yulin, CUI Miao, CHEN Wei, ZHANG Jing. Online Trajectory Optimization for the UAV-Enabled Base Station Multicasting System Based on Reinforcement Learning[J]. Journal of Electronics & Information Technology, 2022, 44(3): 969-975. doi: 10.11999/JEIT210429
Citation: ZHANG Guangchi, YAN Yulin, CUI Miao, CHEN Wei, ZHANG Jing. Online Trajectory Optimization for the UAV-Enabled Base Station Multicasting System Based on Reinforcement Learning[J]. Journal of Electronics & Information Technology, 2022, 44(3): 969-975. doi: 10.11999/JEIT210429

基于强化学习的无人机基站多播通信系统的飞行路线在线优化

doi: 10.11999/JEIT210429
基金项目: 广东省科技计划项目(2020A050515010, 2021A0505030015, 2019B010119001),广东特支计划项目(2019TQ05X409)
详细信息
    作者简介:

    张广驰:男,1982年生,教授,研究方向为新一代无线通信技术

    严雨琳:女,1996年生,硕士生,研究方向为无人机通信、强化学习

    崔苗:女,1978年生,讲师,研究方向为新一代无线通信技术

    陈伟:男,1979年生,高级工程师,研究方向为地质灾害监测与预警

    张景:男,1974年生,研究员级高工,研究方向为新一代信息通信技术

    通讯作者:

    崔苗 cuimiao@gdut.edu.cn

  • 1)本文主要研究无人机基站的飞行路线在线优化对多播通信性能的影响,为简单起见,暂时没有考虑无人机基站的能耗问题,考虑能耗的无人机基站飞行路线在线优化将是未来的研究方向之一。另外,本文考虑的系统模型可以扩展到多个无人机协作多播通信的场景。
  • 2)为了便于完善在线优化算法理论和检验算法的性能,本文采用了LoS空地(地空)信道模型。在未来的工作中,可以将本文提出的算法直接扩展到其他更准确的信道模型上。
  • 3)未来的研究中,若考虑多个无人机协作多播通信的场景,可以考虑在奖赏函数中设置干扰项以及设置防碰撞约束。
  • 中图分类号: TN915

Online Trajectory Optimization for the UAV-Enabled Base Station Multicasting System Based on Reinforcement Learning

Funds: The Science and Technology Plan Project of Guangdong Province (2020A050515010, 2021A0505030015, 2019B010119001), The Special Support Plan for High-Level Talents of Guangdong Province (2019TQ05X409)
  • 摘要: 针对无人机(UAV)基站(BS)多播通信系统的通信时延最小化问题,该文提出飞行路线在线优化算法。在该系统中无人机基站向多个地面用户同时发送公共信息,其中每次通信任务中地面用户位置是随机的。为了保证地面用户能够接收完整的公共信息以及考虑到无人机的能量有限性,该文以最小化无人机基站完成通信任务的平均时间为目标。首先将问题转化成一个马尔可夫决策过程(MDP);然后把通信时延引入到动作价值函数中;最后提出使用Q-Learning算法对无人机飞行路线进行学习和在线优化,从而实现平均通信时延最小化。仿真结果显示,与其他基准方案相比,该文所提方案能够有效地为无人机多播通信系统飞行路线实现在线优化,并有效降低通信任务的完成时间。
  • 图  1  无人机基站多播通信系统

    图  2  基于Q-Learning算法的在线优化设计算法的训练过程

    图  3  不同方案下的无人机基站飞行路线对比图

    图  4  不同方案下的无人机基站飞行路线对比图

    图  5  不同方案下的无人机基站完成不同信息量的多播任务时的平均时间对比图

    图  6  不同方案下的无人机基站完成100次通信任务的平均时间

  • [1] WU Qingqing, XU Jie, ZENG Yong, et al. A comprehensive overview on 5G-and-beyond networks with UAVs: From communications to sensing and intelligence[J]. IEEE Journal on Selected Areas in Communications, 2021, 39(10): 2912–2945. doi: 10.1109/JSAC.2021.3088681
    [2] LYU Jiangbin, ZENG Yong, and ZHANG Rui. UAV-aided offloading for cellular hotspot[J]. IEEE Transactions on Wireless Communications, 2018, 17(6): 3988–4001. doi: 10.1109/TWC.2018.2818734
    [3] FENG Wanmei, TANG Jie, ZHAO Nan, et al. NOMA-based UAV-aided networks for emergency communications[J]. China Communications, 2020, 17(11): 54–66. doi: 10.23919/JCC.2020.11.005
    [4] ZENG Yong, ZHANG Rui, and LIM T J. Throughput maximization for UAV-enabled mobile relaying systems[J]. IEEE Transactions on Communications, 2016, 64(12): 4983–4996. doi: 10.1109/TCOMM.2016.2611512
    [5] MOZAFFARI M, SAAD W, BENNIS M, et al. Mobile Unmanned Aerial Vehicles (UAVs) for energy-efficient internet of things communications[J]. IEEE Transactions on Wireless Communications, 2017, 16(11): 7574–7589. doi: 10.1109/TWC.2017.2751045
    [6] WANG Zhe, DUAN Lingjie, and ZHANG Rui. Adaptive deployment for UAV-aided communication networks[J]. IEEE Transactions on Wireless Communications, 2019, 18(9): 4531–4543. doi: 10.1109/TWC.2019.2926279
    [7] ZENG Yong, XU Jie, and ZHANG Rui. Energy minimization for wireless communication with rotary-wing UAV[J]. IEEE Transactions on Wireless Communications, 2019, 18(4): 2329–2345. doi: 10.1109/TWC.2019.2902559
    [8] WU Qingqing, ZENG Yong, and ZHANG Rui. Joint trajectory and communication design for multi-UAV enabled wireless networks[J]. IEEE Transactions on Wireless Communications, 2017, 17(3): 2109–2121. doi: 10.1109/TWC.2017.2789293
    [9] LIU Tianyu, CUI Miao, ZHANG Guangchi, et al. 3D trajectory and transmit power optimization for UAV-enabled multi-link relaying systems[J]. IEEE Transactions on Green Communications and Networking, 2021, 5(1): 392–405. doi: 10.1109/TGCN.2020.3048135
    [10] ZENG Yong and XU Xiaoli. Path design for cellular-connected UAV with reinforcement learning[C]. 2019 IEEE Global Communications Conference (GLOBECOM), Waikoloa, USA, 2019: 1–6.
    [11] KHAMIDEHI B and SOUSA E S. Reinforcement learning-based trajectory design for the aerial base stations[C]. The 30th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Istanbul, Turkey, 2019: 1–6.
    [12] LIU Xiao, LIU Yuanwei, and CHEN Yue. Reinforcement learning in multiple-UAV networks: Deployment and movement design[J]. IEEE Transactions on Vehicular Technology, 2019, 68(8): 8036–8049. doi: 10.1109/TVT.2019.2922849
    [13] SAXENA V, JALDÉN J, and KLESSIG H. Optimal UAV base station trajectories using flow-level models for reinforcement learning[J]. IEEE Transactions on Cognitive Communications and Networking, 2019, 5(4): 1101–1112. doi: 10.1109/TCCN.2019.2948324
    [14] ZENG Yong, XU Xiaoli, and ZHANG Rui. Trajectory design for completion time minimization in UAV-enabled multicasting[J]. IEEE Transactions on Wireless Communications, 2018, 17(4): 2233–2246. doi: 10.1109/TWC.2018.2790401
    [15] GOLDSMITH A. Wireless Communications[M]. Cambridge: Cambridge University Press, 2005: 26–27.
    [16] SUTTON R S and BARTO A G. Reinforcement Learning: An Introduction[M]. Cambridge: MIT Press, 2018: 1–130.
    [17] BELLMAN R. A markovian decision process[J]. Journal of Mathematics and Mechanics, 1957, 6(5): 679–684. doi: 10.1512/iumj.1957.6.56038
  • 加载中
图(6)
计量
  • 文章访问数:  503
  • HTML全文浏览量:  358
  • PDF下载量:  131
  • 被引次数: 0
出版历程
  • 收稿日期:  2021-05-19
  • 修回日期:  2021-09-16
  • 录用日期:  2021-09-16
  • 网络出版日期:  2021-12-25
  • 刊出日期:  2022-03-28

目录

    /

    返回文章
    返回