基于改进深度Q学习的网络选择算法

马彬; 陈海波; 张超

doi:10.11999/JEIT200930

基于改进深度Q学习的网络选择算法

doi: 10.11999/JEIT200930

1.
重庆邮电大学重庆市计算机网络与通信技术重点实验室重庆 400065
2.
重庆邮电大学计算机科学与技术学院重庆 400065

基金项目: 重庆市教委科学技术研究重大项目(KJZD-M201900602)，重庆市教委科学技术研究重点项目(KJZD-M201800603)，重庆市基础研究与前沿探索项目(CSTC2018jcyjAX0432)，重庆市研究生科研创新项目(CYS20256)

详细信息

作者简介:
马彬：男，1978年生，教授，博士生导师，主要研究方向为异构无线网络、认知无线电网络等

陈海波：男，1994年生，硕士生，研究方向为异构无线网络

张超：男，1994年生，硕士生，研究方向为异构无线网络

通讯作者:
陈海波　860452738@qq.com

中图分类号: TN915
计量
- 文章访问数: 1097
- HTML全文浏览量: 694
- PDF下载量: 122
- 被引次数: 0
出版历程
- 收稿日期: 2020-10-30
- 修回日期: 2021-05-26
- 网络出版日期: 2021-08-24
- 刊出日期: 2022-01-10

Network Selection Algorithm Based on Improved Deep Q-Learning

1.
Chongqing Key Laboratory of Computer Network and Communication Technology, Chongqing University of Post and Telecommunications, Chongqing 400065, China
2.
Institute of Computer Science and Technology, Chongqing University of Post and Telecommunications, Chongqing 400065, China

Funds: The Major Project of Science and Technology Research of Chongqing Education Commission (KJZD-M201900602), The Key Project of Science and Technology Research of Chongqing Education Commission (KJZD-M201800603), The Foundation Research and Advanced Exploration Project of Chongqing (CSTC2018jcyjAX0432), The Project of Science Research Innovation of Chongqing Graduate Students (CYS20256)

摘要

摘要: 在引入休眠机制的超密集异构无线网络中，针对网络动态性增强，导致切换性能下降的问题，该文提出一种基于改进深度Q学习的网络选择算法。首先，根据网络的动态性分析，构建深度Q学习选网模型；其次，将深度Q学习选网模型中线下训练模块的训练样本与权值，通过迁移学习，将其迁移到线上决策模块中；最后，利用迁移的训练样本及权值加速训练神经网络，得到最佳选网策略。实验结果表明，该文算法显著改善了因休眠机制导致的高动态性网络切换性能下降问题，同时降低了传统深度Q学习算法在线上选网过程中的时间复杂度。
- 超密集异构无线网络 /
- 改进深度Q学习 /
- 网络选择
Abstract: In ultra dense heterogeneous wireless network with sleep mechanism, in view of the problem that the network dynamic is enhanced and the handoff performance is reduced, a network selection algorithm based on improved deep Q-learning is proposed. Firstly, according to the dynamic analysis of the network, a deep Q-learning network selection model is constructed; Secondly, the training samples and weights of the offline training module in deep Q-learning network selection model, which are transferred to the online network decision-making module through the transfer learning; Finally, the training samples and weights of transfer are used to accelerate the process of training neural network, and the optimal network selection strategy is obtained. Experimental results demonstrate that the proposed algorithm improves significantly the performance degradation of high dynamic network handoff caused by sleep mechanism and the time complexity of traditional deep Q-learning algorithm for online network selection.
- Ultra dense heterogeneous wireless network /
- Improved deep Q-learning /
- Network selection

HTML全文

图 1 本文算法流程图

下载: 全尺寸图片幻灯片

图 2 终端移动模型图

下载: 全尺寸图片幻灯片

图 3 超密集异构无线网络仿真场景图

下载: 全尺寸图片幻灯片

图 4 算法时间开销

下载: 全尺寸图片幻灯片

图 5 平均信干噪比

下载: 全尺寸图片幻灯片

图 6 平均吞吐量

下载: 全尺寸图片幻灯片

图 7 网络掉话率

下载: 全尺寸图片幻灯片

图 8 网络总切换次数

下载: 全尺寸图片幻灯片

表 1 候选网络的参数值

网络	接收信号强度(dBm)	路径损失(dB)	噪声偏差(dBm)	吞吐量(kbps)	负载量(个)
MBS1	–85	48	6	1100	68
MBS2	–70	51	9	900	52
SBS1	–78	47	8	2700	16
SBS2	–72	53	8	2600	23
SBS3	–86	50	7	2900	25
SBS4	–95	49	6	3100	20
AP1	–60	45	9	4800	12
AP2	–75	43	6	6400	8
AP3	–71	47	7	5500	10

下载: 导出CSV

参考文献(17)

[1]	YAN Xiaohuan, ŞEKERCIOĞLU A, and NARAYANAN S. A survey of vertical handover decision algorithms in Fourth Generation heterogeneous wireless networks[J]. Computer Networks, 2010, 54(11): 1848–1863. doi: 10.1016/j.comnet.2010.02.006
[2]	XIE Shengdong and WU Meng. Adaptive variable threshold vertical handoff algorithm[C]. 2008 International Conference on Neural Networks and Signal Processing, Nanjing, China, 2008: 366–369. doi: 10.1109/ICNNSP.2008.4590373.
[3]	LIU Min, LI Zhongcheng, and GUO Xiaobing. An efficient handoff decision algorithm for vertical handoff between WWAN and WLAN[J]. Journal of Computer Science and Technology, 2007, 22(1): 114–120. doi: 10.1007/s11390-007-9016-8
[4]	ZAHRAN A H, LIANG Ben, and SALEH A. Signal threshold adaptation for vertical handoff in heterogeneous wireless networks[J]. Mobile Networks and Applications, 2006, 11(4): 625–640. doi: 10.1007/s11036-006-7326-7
[5]	HAIDER A, GONDAL I, and KAMRUZZAMAN J. Dynamic dwell timer for hybrid vertical handover in 4G coupled networks[C]. 2011 IEEE 73rd Vehicular Technology Conference (VTC Spring), Budapest, Hungary, 2011: 1–5. doi: 10.1109/VETECS.2011.5956636.
[6]	马彬, 张文静, 谢显中. 面向终端个性化服务的模糊垂直切换算法[J]. 电子与信息学报, 2017, 39(6): 1284–1290. doi: 10.11999/JEIT160839 MA Bin, ZHANG Wenjing, and XIE Xianzhong. Individualization service oriented fuzzy vertical handover algorithm[J]. Journal of Electronics &Information Technology, 2017, 39(6): 1284–1290. doi: 10.11999/JEIT160839
[7]	ALSAMHI S H and RAJPUT N S. An intelligent hand-off algorithm to enhance quality of service in high altitude platforms using neural network[J]. Wireless Personal Communications, 2015, 82(4): 2059–2073. doi: 10.1007/s11277-015-2333-2
[8]	马彬, 李尚儒, 谢显中. 异构无线网络中基于人工神经网络的自适应垂直切换算法[J]. 电子与信息学报, 2019, 41(5): 1210–1216. doi: 10.11999/JEIT180534 MA Bin, LI Shangru, and XIE Xianzhong. An adaptive vertical handover algorithm based on artificial neural network in heterogeneous wireless networks[J]. Journal of Electronics &Information Technology, 2019, 41(5): 1210–1216. doi: 10.11999/JEIT180534
[9]	NURJAHAN, RAHMAN S, SHARMA T, et al. PSO-NF based vertical handoff decision for ubiquitous heterogeneous wireless network (UHWN)[C]. 2016 International Workshop on Computational Intelligence (IWCI), Dhaka, Bangladesh, 2016: 153–158. doi: 10.1109/IWCI.2016.7860357.
[10]	YANG Bingtao, WANG Xue, and QIAN Zhihong. A multi-armed bandit model-based vertical handoff algorithm for heterogeneous wireless networks[J]. IEEE Communications Letters, 2018, 22(10): 2116–2119. doi: 10.1109/LCOMM.2018.2861731
[11]	CHEN Jiamei, WANG Yao, LI Yufeng, et al. QoE-aware intelligent vertical handoff scheme over heterogeneous wireless access networks[J]. IEEE Access, 2018, 6: 38285–38293. doi: 10.1109/ACCESS.2018.2853730
[12]	HAN Zijun, LEI Tao, LU Zhaoming, et al. Artificial intelligence-based handoff management for dense WLANs: A deep reinforcement learning approach[J]. IEEE Access, 2019, 7: 31688–31701. doi: 10.1109/access.2019.2900445
[13]	ALJERI N and BOUKERCHE A. A two-tier machine learning-based handover management scheme for intelligent vehicular networks[J]. Ad Hoc Networks, 2019, 94: 101930. doi: 10.1016/j.adhoc.2019.101930
[14]	马彬, 王梦雪, 谢显中. 超密集异构无线网络中基于位置预测的切换算法[J]. 电子与信息学报, 2020, 42(12): 2899–2907. doi: 10.11999/JEIT190751 MA Bin, WANG Mengxue, and XIE Xianzhong. Handoff algorithm based on location prediction in ultra-dense heterogeneous wireless network[J]. Journal of Electronics &Information Technology, 2020, 42(12): 2899–2907. doi: 10.11999/JEIT190751
[15]	ZHANG Xuefei, XIE Yuxuan, CUI Yushan, et al. Multi-slot coverage probability and SINR-based handover rate analysis for mobile user in HetNet[J]. IEEE Access, 2018, 6: 17868–17879. doi: 10.1109/ACCESS.2018.2821761
[16]	SAAD H, MOHAMED A, and ELBATT T. A cooperative Q-learning approach for distributed resource allocation in multi-user femtocell networks[C]. 2014 IEEE Wireless Communications and Networking Conference (WCNC), Istanbul, Turkey, 2014: 1490–1495. doi: 10.1109/WCNC.2014.6952410.
[17]	SUN Yaohua, PENG Mugen, and MAO Shiwen. Deep reinforcement learning-based mode selection and resource management for green fog radio access networks[J]. IEEE Internet of Things Journal, 2019, 6(2): 1960–1971. doi: 10.1109/JIOT.2018.2871020