Joint Optimization of Edge Selection and Resource Allocation in Digital Twin-assisted Federated Learning
-
摘要: 在基于联邦学习的智能驾驶中,智能网联汽车(ICV)的资源限制和可能出现的设备故障会导致联邦学习训练精度下降、时延和能耗增加等问题。为此该文提出数字孪生辅助联邦学习中的边缘选择和资源分配优化方案。该方案首先提出数字孪生辅助联邦学习机制,使得ICV能够选择在本地或利用其数字孪生体参与联邦学习。其次,通过构建数字孪生辅助联邦学习的计算和通信模型,建立以最小化累积训练时延和能耗为目标的边缘选择和资源分配联合优化问题,并将其转化为部分可观测的马尔可夫决策过程。最后,提出基于多智能体参数化Q网络(MPDQN)的边缘选择和资源分配算法,用于学习近似最优的边缘选择和资源分配策略,以实现联邦学习累积时延和能耗最小化。仿真结果表明,所提算法在保证模型精度的同时,有效降低联邦学习累积训练时延和能耗。Abstract: In intelligent driving based on federated learning, the resource constraints of Intelligent Connected Vehicle (ICV) and possible device failures will lead to the decrease of the precision of federated learning training and the increase of delay and energy consumption. Therefore, an optimization scheme of edge selection and resource allocation in digital twin-assisted federated learning is proposed. Firstly, a digital twin-assisted federated learning mechanism is proposed, allowing ICV to choose to participate in federated learning locally or through its digital twin. Secondly, by constructing a computational and communication model for digital twin-assisted federated learning, an edge selection and computing resource allocation joint optimization problem is established with the objective of minimizing cumulative training delay and energy consumption, and is transformed into a partially observable Markov decision process. Finally, an edge selection and resource allocation algorithm based on Multi-agent Parametrized Deep Q-Networks (MPDQN) is proposed to learn approximately optimal edge selection and resource allocation strategies to minimize federated learning cumulative delay and energy consumption. Simulation results show that the proposed algorithm can effectively reduce cumulative training delay and energy consumption of federated learning training while ensuring model accuracy.
-
Key words:
- Intelligent driving /
- Federated learning /
- Digital twin /
- Deep reinforcement learning
-
算法1 基于MPDQN的边缘选择和资源分配算法 输入:学习率(λd,λc),学习回合数Nmax,概率分布ψ,探索概率ε,小批量大小B,采样数据的学习回合数量Nsam 输出:边缘选择和资源分配策略 (1) 初始化网络参数(θd,θc)和经验回放池 (2) for i=1,2,⋯,Nmax do (3) 收到初始状态s1={sm,1}∀m∈M (4) for 数字孪生辅助联邦学习全局迭代k∈K do (5) for 智能体m∈M do (6) 根据式(22)计算连续动作参数fm(k) 根据ε贪婪策略选择动作am,k={φm(k),fm(k)}: (7) am,k={分布ψ的样本,ε(φm(k),fm(k)),φm(k)=argmaxφQ(sm,k,φm(k),f∗m(k)),1−ε (9) 执行动作am,k,获得瞬时奖励rm,k和下一个状态sm,k+1 (10) 将元组[sm,k,am,k,rm,k,sm,k+1]存入经验回放池Dm (11) 经验回放池Dm中采样一组小批量B的数据样本 (12) 根据式(19)更新TQN的目标函数ym(k) (13) 根据式(20)和式(21)分别计算损失函数L(ϖm,d)和L(ϖm,c) (15) 根据式(22)和式(23)更新网络参数ϖm,d(k+1)和ϖm,c(k+1) (17) if (i>Nsam) then (18) 从经验回放池D中采样一组小批量B的数据样本 (19) 更新参数ϖd,me(k+1)←λ′me∇ϖd,mel(ϖ)和ϖc,me(k+1)←λ′me∇l(ϖc) (21) 融合网络下发最新的参数至各个智能体 (22) end if (23) end for (24) end for (25) end for -
[1] BOUKERCHE A and DE GRANDE R E. Vehicular cloud computing: Architectures, applications, and mobility[J]. Computer Networks, 2018, 135: 171–189. doi: 10.1016/j.comnet.2018.01.004. [2] ARENA F and PAU G. An overview of vehicular communications[J]. Future Internet, 2019, 11(2): 27. doi: 10.3390/fi11020027. [3] BENNIS M. Federated learning and control at the wireless network edge[J]. GetMobile:Mobile Computing and Communications, 2021, 24(3): 9–13. doi: 10.1145/3447853.3447857. [4] CHEN Mingzhe, POOR H V, SAAD W, et al. Convergence time minimization of federated learning over wireless networks[C]. ICC 2020–2020 IEEE International Conference on Communications (ICC), Dublin, Ireland, 2020: 1–6. [5] WU Yiwen, ZHANG Ke, and ZHANG Yan. Digital twin networks: a survey[J]. IEEE Internet of Things Journal, 2021, 8(18): 13789–13804. doi: 10.1109/JIOT.2021.3079510. [6] GRIEVES M and VICKERS J. Digital twin: Mitigating unpredictable, undesirable emergent behavior in complex systems[M]. KAHLEN F J, FLUMERFELT S, and ALVES A. Transdisciplinary Perspectives on Complex Systems: New Findings and Approaches. Cham, Germany: Springer, 2017: 85–113. [7] DAI Yueyue, GUAN Yongliang, LEUNG K K, et al. Reconfigurable intelligent surface for low-latency edge computing in 6G[J]. IEEE Wireless Communications, 2021, 28(6): 72–79. doi: 10.1109/MWC.001.2100229. [8] SUN Wen, LEI Shiyu, WANG Lu, et al. Adaptive federated learning and digital twin for industrial internet of things[J]. IEEE Transactions on Industrial Informatics, 2021, 17(8): 5605–5614. doi: 10.1109/TII.2020.3034674. [9] HUI Yilong, ZHAO Gaosheng, LI Chengle, et al. Digital twins enabled on-demand matching for multi-task federated learning in HetVNets[J]. IEEE Transactions on Vehicular Technology, 2023, 72(2): 2352–2364. doi: 10.1109/TVT.2022.3211005. [10] LU Yunlong, MAHARJAN S, and ZHANG Yan. Adaptive edge association for wireless digital twin networks in 6G[J]. IEEE Internet of Things Journal, 2021, 8(22): 16219–16230. doi: 10.1109/JIOT.2021.3098508. [11] XIONG Jiechao, WANG Qing, YANG Zhuoran, et al. Parametrized deep Q-networks learning: Reinforcement learning with discrete-continuous hybrid action space[J]. arXiv: 1810.06394, 2018. [12] YIN Sixing and YU F R. Resource allocation and trajectory design in UAV-aided cellular networks based on multiagent reinforcement learning[J]. IEEE Internet of Things Journal, 2022, 9(4): 2933–2943. doi: 10.1109/JIOT.2021.3094651. [13] XIAO Han, RASUL K, and VOLLGRAF R. Fashion-MNIST: a novel image dataset for benchmarking machine learning algorithms[J]. arXiv: 1708.07747, 2017. [14] YU Xiangbin, XU Weiye, LEUNG S H, et al. Power allocation for energy efficient optimization of distributed MIMO system with beamforming[J]. IEEE Transactions on Vehicular Technology, 2019, 68(9): 8966–8981. doi: 10.1109/TVT.2019.2931291. [15] ZHANG Jiaxiang, LIU Yiming, QIN Xiaoqi, et al. Energy-efficient federated learning framework for digital twin-enabled industrial internet of things[C]. The IEEE 32nd Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), Helsinki, Finland, 2021: 1160–1166. 期刊类型引用(4)
1. 张仕斌,黄曦,昌燕,闫丽丽,程稳. 大数据环境下量子机器学习的研究进展及发展趋势. 电子科技大学学报. 2021(06): 802-819 . 百度学术
2. 谢丽霞,魏瑞炘. 一种面向物联网节点的综合信任度评估模型. 西安电子科技大学学报. 2019(04): 58-65 . 百度学术
3. 谢丽霞,魏瑞炘. 物联网节点动态信任度评估方法. 计算机应用. 2019(09): 2597-2603 . 百度学术
4. 廖新考,王力生,刘晓建,许晓洁. 网络环境下的个性化信任模型PTM. 计算机科学. 2017(08): 100-106 . 百度学术
其他类型引用(9)
-