一种基于马尔可夫决策过程的认知无线电网络传输调度方案
doi: 10.3724/SP.J.1146.2008.00960
A Transmission and Scheduling Scheme Based on Markov Decision Process in Cognitive Radio Networks
-
摘要: 该文提出了一种适用于认知无线电网络的跨层传输调度方案,即满足掉包率约束的前提下最小化平均功率消耗。此方案被建模为约束马尔可夫决策过程(MDP)。采用拉格朗日乘子法求解此MDP,并且提出了一种黄金分割乘子搜索法。提出两种简化方法,即状态聚合以及行动集缩减来解决维灾问题。仿真结果显示简化方法对该方案的性能影响很小,且该方案的平均功耗最低。Abstract: A cross-layer transmission and scheduling scheme of average power minimization in cognitive radio networks under the constraint of packet drop probability is addressed. The scheme is formulated by constrained Markov Decision Process (MDP). Lagrangian multiplier approach is used to solve the MDP, and a golden section search method is proposed to find the multiplier. Two simplifying methods, namely, state aggregate and action set reduction are employed to cope with the curse of dimensionality. Simulation results show that simplifying methods have little influence on the performance of the scheme and average power consumption of the scheme is the lowest.
-
Hossain E and Bhargava V. Cognitive WirelessCommunication Networks [M]. First Edition, New York:Springer, 2007: 1-301.[2]Djonin D V, et al.. Joint rate and power adaptation for type-Ihybrid ARQ systems over correlated fading channels underdifferent buffer cost constraints [J]. IEEE Transactions. onWireless Communications, 2008, 57(1): 421-435.[3]Bolch G.[J].et al.. Queueing Networks and Markov Chains:Modeling and Performance Evaluation with ComputerScience Applications [M]. Second Edition, New York: JohnWiley Sons.2006,:-[4]Chung Seong Taek and Goldsmith A. Degrees of freedom inadaptive modulation: A unified view [J].IEEE Transactions.on Communications.2001, 49(9):1561-1571[5]Chang H S, et al.. Simulation-based Algorithms for MarkovDecision Processes [M]. First Edition, London: Springer-Verlag, 2007: 9-167.[6]Beutle F J and Ross K W. Optimal policies for controlledmarkov chains with a constraint [J]. Journal of MathematicalAnalysis and Application, 1985, 112(1): 236-252.[7]Hossain M J, et al.. Delay limited optimal and suboptimalpower and bit loading algorithms for OFDM systems overcorrelated fading [C]. IEEE GLOBECOM, St. Louis, USA,Dec. 1-2, 2005: 3448-3453.[8]Pandana C and Liu K J R. Near-optimal reinforcementlearning framework for energy-aware sensor communications[J]. IEEE Transactions. on Wireless Communications, 2005,23(4): 788-797.
计量
- 文章访问数: 3275
- HTML全文浏览量: 104
- PDF下载量: 1345
- 被引次数: 0