一种基于马尔可夫决策过程的认知无线电网络传输调度方案

朱江; 徐斌阳; 李少谦

doi:10.3724/SP.J.1146.2008.00960

留言板

尊敬的读者、作者、审稿人, 关于本刊的投稿、审稿、编辑和出版的任何问题, 您可以本页添加留言。我们将尽快给您答复。谢谢您的支持!

姓名

邮箱

手机号码

标题

留言内容

验证码

一种基于马尔可夫决策过程的认知无线电网络传输调度方案

doi: 10.3724/SP.J.1146.2008.00960 cstr: 32379.14.SP.J.1146.2008.00960

基金项目:

国家自然科学基金(60496313)，国家863计划项目(2005AA123910，2007AA01Z209)和国家973规划项目(2009CB320405)资助课题

计量
- 文章访问数: 3401
- HTML全文浏览量: 145
- PDF下载量: 1348
- 被引次数: 0
出版历程
- 收稿日期: 2008-07-30
- 修回日期: 2009-01-05
- 刊出日期: 2009-08-19

A Transmission and Scheduling Scheme Based on Markov Decision Process in Cognitive Radio Networks

摘要

摘要: 该文提出了一种适用于认知无线电网络的跨层传输调度方案，即满足掉包率约束的前提下最小化平均功率消耗。此方案被建模为约束马尔可夫决策过程(MDP)。采用拉格朗日乘子法求解此MDP，并且提出了一种黄金分割乘子搜索法。提出两种简化方法，即状态聚合以及行动集缩减来解决维灾问题。仿真结果显示简化方法对该方案的性能影响很小，且该方案的平均功耗最低。
- 认知无线电;马尔可夫决策过程;跨层设计;传输调度
Abstract: A cross-layer transmission and scheduling scheme of average power minimization in cognitive radio networks under the constraint of packet drop probability is addressed. The scheme is formulated by constrained Markov Decision Process (MDP). Lagrangian multiplier approach is used to solve the MDP, and a golden section search method is proposed to find the multiplier. Two simplifying methods, namely, state aggregate and action set reduction are employed to cope with the curse of dimensionality. Simulation results show that simplifying methods have little influence on the performance of the scheme and average power consumption of the scheme is the lowest.

HTML全文

参考文献(1)

Hossain E and Bhargava V. Cognitive WirelessCommunication Networks [M]. First Edition, New York:Springer, 2007: 1-301.[2]Djonin D V, et al.. Joint rate and power adaptation for type-Ihybrid ARQ systems over correlated fading channels underdifferent buffer cost constraints [J]. IEEE Transactions. onWireless Communications, 2008, 57(1): 421-435.[3]Bolch G.[J].et al.. Queueing Networks and Markov Chains:Modeling and Performance Evaluation with ComputerScience Applications [M]. Second Edition, New York: JohnWiley Sons.2006,:-[4]Chung Seong Taek and Goldsmith A. Degrees of freedom inadaptive modulation: A unified view [J].IEEE Transactions.on Communications.2001, 49(9):1561-1571[5]Chang H S, et al.. Simulation-based Algorithms for MarkovDecision Processes [M]. First Edition, London: Springer-Verlag, 2007: 9-167.[6]Beutle F J and Ross K W. Optimal policies for controlledmarkov chains with a constraint [J]. Journal of MathematicalAnalysis and Application, 1985, 112(1): 236-252.[7]Hossain M J, et al.. Delay limited optimal and suboptimalpower and bit loading algorithms for OFDM systems overcorrelated fading [C]. IEEE GLOBECOM, St. Louis, USA,Dec. 1-2, 2005: 3448-3453.[8]Pandana C and Liu K J R. Near-optimal reinforcementlearning framework for energy-aware sensor communications[J]. IEEE Transactions. on Wireless Communications, 2005,23(4): 788-797.

施引文献

资源附件(0)

访问统计

计量

文章访问数: 3401
HTML全文浏览量: 145
PDF下载量: 1348
被引次数: 0

留言板

一种基于马尔可夫决策过程的认知无线电网络传输调度方案

doi: 10.3724/SP.J.1146.2008.00960 cstr: 32379.14.SP.J.1146.2008.00960

计量

出版历程

A Transmission and Scheduling Scheme Based on Markov Decision Process in Cognitive Radio Networks

计量

出版历程

目录