面向无线虚拟现实传输的生理信号驱动体验质量优化

吴昌; 彭铭宇; 陈雨昂; 陈艺元; 郭凤谦; 秦晓卫; 卢汉成

doi:10.11999/JEIT260067

面向无线虚拟现实传输的生理信号驱动体验质量优化

doi: 10.11999/JEIT260067 cstr: 32379.14.JEIT260067

中国科学技术大学未来网络实验室合肥 230027

基金项目: 国家自然科学基金联合基金重点项目(U21A20452)

详细信息

作者简介:
吴昌：男，博士生，研究方向为用户中心网络、B5G/6G、智能无线接入网络、业务调度、拥塞控制

彭铭宇：男，硕士生，研究方向为无线通信、边缘计算

陈雨昂：男，博士生，研究方向为虚拟现实流媒体传输、下一代超可靠低延迟通信

陈艺元：男，博士生，研究方向为基于人工智能的生物信号处理

郭凤谦：男，副研究员，研究方向为无线资源优化、多媒体传输

秦晓卫：男，副教授，研究方向为移动通信网络大数据、终端低能耗

卢汉成：男，教授，研究方向为无线通信系统中的资源优化、无线网络边缘的缓存与服务卸载

通讯作者:
卢汉成　hclu@ustc.edu.cn

1¹⁾本文代码可在https://github.com/ChangWu98/LiveVR_PPO获取。
中图分类号: TN911.7
计量
- 文章访问数: 190
- HTML全文浏览量: 106
- PDF下载量: 26
- 被引次数: 0
出版历程
- 收稿日期: 2026-01-20
- 修回日期: 2026-05-18
- 录用日期: 2026-05-29
- 网络出版日期: 2026-06-08

Physiological Signal-driven QoE Optimization for Wireless Virtual Reality Transmission

Information Network Laboratory, University of Science and Technology of China, Hefei 230027, China

Funds: The National Natural Science Foundation of China Joint Fund Key Project (U21A20452)

摘要

摘要: 虚拟现实(VR)流媒体传输中的突发分辨率变化会显著降低用户的体验质量(QoE)，尤其是在从高分辨率向低分辨率切换的过程中。现有的QoE模型与传输方案未能充分解决这类变化对感知的影响。为弥补这一空白，该文提出一种创新的、生理信号驱动的QoE建模与优化框架，该框架充分利用了用户的脑电图(EEG)、心电图(ECG)以及皮肤电活动，能够精确捕捉VR流媒体传输中生理反应与分辨率变化的时间动态，从而实现对分辨率上升所带来收益以及分辨率下降所造成影响的准确量化。通过在一个深度强化学习(DRL)框架下将所提出的QoE模型集成到无线接入网络(RAN)中，该文实现了自适应传输策略，以动态分配无线资源，从而缓解短期信道波动，并根据用户移动性引发的信道变化调整帧分辨率。通过优先保证长期分辨率并尽量减少突发切换，所提方案相较于基线方案实现了88.7%的分辨率提升，并使分辨率切换频率降低了81.0%。实验结果证明了该生理信号驱动策略的有效性，并凸显了边缘人工智能在沉浸式媒体服务中的应用前景。
- 无线虚拟现实 /
- 生理信号 /
- 体验质量 /
- 深度强化学习
Abstract: Objective Virtual Reality (VR) has become a transformative medium for immersive digital experiences because it can deliver high-resolution 360° video with ultra-low Motion-To-Photon (MTP) latency. However, its dependence on wireless transmission creates major challenges. Uncompressed data rates above 1 Gbit/(s·Hz) and latency thresholds below 20 ms place stringent demands on network infrastructure. In mobile scenarios, channel fluctuation and user mobility often compromise service continuity and cause abrupt resolution changes. Traditional Quality of Service (QoS) metrics, such as bandwidth, jitter, and packet loss, provide useful network-level information but cannot adequately reflect subjective user satisfaction. Existing Quality of Experience (QoE) models and Adaptive BitRate (ABR) algorithms often use symmetric metrics, such as Mean Opinion Score (MOS), and overlook the fact that users perceive quality deterioration and quality improvement differently. Sudden resolution downgrading has a stronger negative effect on immersion than the positive effect caused by resolution upgrading. This perceptual asymmetry is consistent with behavioral psychology but remains insufficiently addressed in current transmission schemes. In addition, the separation between Radio Access Network (RAN) resource provisioning and application-layer bitrate adaptation often causes mismatched optimization, video-quality oscillation, and resource underuse. To address these issues, this study establishes a quantitative link between physiological responses and resolution changes. It further develops a physiological signal-driven QoE framework integrated with Deep Reinforcement Learning (DRL) to support adaptive transmission, maximize immersion, and reduce the adverse effects of resolution fluctuation in resource-constrained wireless networks. Methods A two-stage method is adopted, including physiological signal analysis and joint optimization framework design. A controlled VR experiment is conducted to quantify the perceptual effect of resolution changes. Nineteen healthy subjects participate in a viewing task using an eye-tracking VR headset, a 32-channel wireless ElectroEncephaloGraphy (EEG) system, ElectroCardioGraphy (ECG) recording, and Galvanic Skin Response (GSR) sensors. The subjects view natural-scene videos in which the resolution levels, including 8k, 4k, 1080P, 720P, and 480P, switch randomly every 8 s. The collected EEG signals are preprocessed by independent component analysis and band-pass filtering. Event-Related Potential (ERP) components are analyzed, with emphasis on the N200 component in the temporal and occipital regions, which reflects visual processing and attention allocation. A Linear Discriminant Analysis (LDA) classifier is used to distinguish different response types. The analysis focuses on the asymmetry between resolution upgrading and downgrading, and on sensitivity to the magnitude of resolution jumps. Based on these physiological findings, a QoE model is formulated by adding penalty terms for resolution degradation and large-amplitude resolution switching. These penalties are weighted more strongly than upgrade rewards to represent user aversion to quality drops. The model is then integrated into an edge-computing environment through a dual-timescale DRL framework. The framework separates control into two cooperative agents: the Scheduling and Utility (SU) agent and the Resolution Scaling (RS) agent. The SU agent operates at the millisecond timescale and performs real-time wireless resource allocation. It uses a Gated Recurrent Unit (GRU) to extract temporal features from Channel State Information (CSI) and transmission history. It then dynamically allocates bandwidth to improve frame delivery success and maintain fairness under VR frame-deadline constraints. The RS agent operates at the frame timescale and determines the resolution of subsequent video frames. Its decision-making is guided by the physiological signal-driven reward function, which penalizes actions that may trigger negative physiological responses, such as sharp resolution drops, unless channel deterioration makes them necessary. Proximal Policy Optimization (PPO) is selected for both agents because of its stable learning behavior in continuous and discrete action spaces. Simulations are conducted using a 3GPP-based wireless channel module with user mobility, shadow fading, and path loss to create a dynamic network environment. Results and Discussions The physiological experiment and network simulations validate the proposed framework. In the physiological analysis, a clear N200 response is observed approximately 200 ms after resolution changes. The N200 amplitude is significantly larger during resolution downgrading than during resolution upgrading (p < 0.001), indicating that users are more sensitive to quality deterioration. Large resolution jumps, such as changes from 8k to 1080P, also induce stronger neural responses and more concentrated occipital energy than minor adjustments. The LDA classifier achieves an average Area Under the Curve (AUC) of 74.12% across 19 subjects, confirming that neural responses contain discriminative information about the direction of resolution change. The GSR results support these findings. A dual-branch GSR feature extraction and classification model reaches an average AUC of 78.10% in distinguishing upward and downward switching events. By contrast, ECG signals do not show a stable effect under the current experimental setting and analysis granularity. Therefore, the subsequent QoE model is mainly constructed from EEG and GSR findings. In the network performance evaluation, the proposed physiological signal-driven DRL framework is compared with several baselines, including Proportional-Fair (PF) scheduling, equal resource allocation, and traditional congestion control represented by SCReAM. The training curves show that the dual-agent system converges and learns to coordinate capacity provisioning with resolution decisions. The SU agent smooths short-term channel fluctuation and provides a stable capacity basis, which enables the RS agent to make more reliable resolution decisions. Quantitative results show that the proposed scheme improves the average video resolution by up to 88.7% compared with the equal-resource baseline. More critically, the resolution switching frequency is reduced by up to 81.0%. This reduction is essential because frequent switching, especially downward switching, causes user discomfort, as demonstrated by the physiological analysis. By prioritizing long-term resolution stability and penalizing abrupt drops through the physiological signal-driven reward function, the proposed system reduces the “ping-pong” effect commonly observed in traditional ABR algorithms. Compared with schemes using different penalty weights, the proposed method achieves a better balance. It avoids overly conservative behavior under large penalties, which lowers the average resolution, and unstable visual quality under small penalties, which increases resolution fluctuation. The joint optimization also allocates resources preferentially to users with urgent frame deadlines or higher risks of perceptible quality degradation, while maintaining a frame delivery success rate above 99%. Conclusions This paper addresses the conflict between wireless-channel instability and the human need for visually consistent VR streaming. By adopting a physiological signal-driven approach, the asymmetric effect of resolution changes on user experience is quantified, which challenges the symmetric assumptions used in traditional QoE models. Integrating this physiological evidence into a dual-timescale DRL framework enables the RAN to go beyond throughput-oriented optimization. Wireless resource allocation supports stable application-layer adaptation, while application-layer demands guide resource scheduling. The proposed solution improves immersive experience by increasing average resolution and reducing the physiologically disruptive effects of sudden quality degradation. The reduction in resolution switching frequency by more than 80% shows that the system can shield users from network variability. This study also indicates the value of edge intelligence in making resource-allocation decisions based on human perception rather than network statistics alone. Future work should extend the QoE model by considering multisensory factors, such as MTP latency, cybersickness, spatial distortion, stalling, and audiovisual synchronization. Individual differences in physiological sensitivity should also be addressed through personalized modeling. For real-world deployment, privacy protection is essential. Federated learning and local edge updates may allow biometric data to be processed locally while supporting global policy optimization. This work provides a human-centric basis for immersive networking and shifts the focus from QoS to physiologically validated QoE.
- Wireless virtual reality /
- physiological signals /
- Quality of Experience (QoE) /
- Deep Reinforcement Learning (DRL)

HTML全文

图 1 无线VR视频传输系统的结构与工作流程

下载: 全尺寸图片幻灯片

图 2 VR生理实验系统结构及相关结果

下载: 全尺寸图片幻灯片

图 3 GSR特征提取与分类模型架构

下载: 全尺寸图片幻灯片

图 4 采用GRU增强特征提取的多智能体DRL框架

下载: 全尺寸图片幻灯片

图 5 所提方案在不同用户数量下的训练曲线与运行时间

下载: 全尺寸图片幻灯片

图 6 两个智能体在时变信道中的联合决策效果

下载: 全尺寸图片幻灯片

图 7 不同方案在平均视频帧分辨率、切换率以及帧成功率方面的性能对比

下载: 全尺寸图片幻灯片

参考文献(24)

[1]	YAQOOB A, BI Ting, and MUNTEAN G M. A survey on adaptive 360° video streaming: Solutions, challenges and opportunities[J]. IEEE Communications Surveys & Tutorials, 2020, 22(4): 2801–2838. doi: 10.1109/COMST.2020.3006999.
[2]	CHEN Yuang, LU Hancheng, QIN Langtian, et al. Streaming 360° VR video with statistical QoS provisioning in mmWave networks from delay and rate perspectives[J]. IEEE Transactions on Wireless Communications, 2025, 24(6): 4721–4737. doi: 10.1109/TWC.2025.3543615.
[3]	WEN Wen, LI Mu, YAO Yiru, et al. Perceptual quality assessment of virtual reality videos in the wild[J]. IEEE Transactions on Circuits and Systems for Video Technology, 2024, 34(9): 8368–8381. doi: 10.1109/TCSVT.2024.3378352.
[4]	KOUGIOUMTZIDIS G, POULKOV V K, LAZARIDIS P I, et al. Deep reinforcement learning-based resource allocation for QoE enhancement in wireless VR communications[J]. IEEE Access, 2025, 13: 25045–25058. doi: 10.1109/ACCESS.2025.3538546.
[5]	GAO Nianzhen, ZHOU Jiaxi, WAN Guoan, et al. Low-latency VR video processing-transmitting system based on edge computing[J]. IEEE Transactions on Broadcasting, 2024, 70(3): 862–871. doi: 10.1109/TBC.2024.3380455.
[6]	兰诚栋, 饶迎节, 宋彩霞, 等. 基于强化学习的立体全景视频自适应流[J]. 电子与信息学报, 2022, 44(4): 1461–1468. doi: 10.11999/JEIT200908. LAN Chengdong, RAO Yingjie, SONG Caixia, et al. Adaptive streaming of stereoscopic panoramic video based on reinforcement learning[J]. Journal of Electronics & Information Technology, 2022, 44(4): 1461–1468. doi: 10.11999/JEIT200908.
[7]	FIEDLER M, ZEPERNICK H J, and KELKKANEN V. Network-induced temporal disturbances in virtual reality applications[C]. 2019 Eleventh International Conference on Quality of Multimedia Experience, Berlin, Germany, 2019: 1–3. doi: 10.1109/QoMEX.2019.8743304.
[8]	ZHANG Jiayi, BLANDINO S, VARSHNEY N, et al. Multi-user MIMO enabled virtual reality in IEEE 802.11ay WLAN[C]. 2022 IEEE Wireless Communications and Networking Conference, Austin, USA, 2022: 2595–2600. doi: 10.1109/WCNC51071.2022.9771778.
[9]	CHAKARESKI J, KHAN M, ROPITAULT T, et al. 6DOF virtual reality dataset and performance evaluation of millimeter wave vs. free-space-optical indoor communications systems for lifelike mobile VR streaming[C]. 2020 54th Asilomar Conference on Signals, Systems, and Computers, Pacific Grove, USA, 2020: 1051–1058. doi: 10.1109/IEEECONF51394.2020.9443328.
[10]	ANWAR M S, WANG Jing, ULLAH A, et al. Measuring quality of experience for 360-degree videos in virtual reality[J]. Science China Information Sciences, 2020, 63(10): 202301. doi: 10.1007/s11432-019-2734-y.
[11]	FEI Zesong, WANG Fei, WANG Jing, et al. QoE evaluation methods for 360-degree VR video transmission[J]. IEEE Journal of Selected Topics in Signal Processing, 2020, 14(1): 78–88. doi: 10.1109/JSTSP.2019.2956631.
[12]	ZUO Xutong, YANG Jiayu, WANG Mowei, et al. Adaptive bitrate with user-level QoE preference for video streaming[C]. IEEE Conference on Computer Communications, London, United Kingdom, 2022: 1279–1288. doi: 10.1109/INFOCOM48880.2022.9796953.
[13]	JOHANSSON I. Self-clocked rate adaptation for conversational video in LTE[C]. The 2014 ACM SIGCOMM Workshop on Capacity Sharing Workshop, Chicago, USA, 2014: 51–56. doi: 10.1145/2630088.2631976.
[14]	MAURA F, CASASNOVAS M, and BELLALTA B. Experimenting with adaptive bitrate algorithms for virtual reality streaming over Wi-Fi[C]. The 30th Annual International Conference on Mobile Computing and Networking, Washington, USA, 2024: 1930–1937. doi: 10.1145/3636534.3697322.
[15]	BAMPIS C G, LI Zhi, KATSAVOUNIDIS I, et al. Towards perceptually optimized adaptive video streaming-a realistic quality of experience database[J]. IEEE Transactions on Image Processing, 2021, 30: 5182–5197. doi: 10.1109/TIP.2021.3073294.
[16]	AGARWAL B, TOGOU M A, RUFFINI M, et al. QoE-driven optimization in 5G O-RAN-enabled HetNets for enhanced video service quality[J]. IEEE Communications Magazine, 2023, 61(1): 56–62. doi: 10.1109/MCOM.003.2200229.
[17]	3GPP. Technical specification (TS) 38.214 Physical layer procedures for data[S/OL]. 3rd Generation Partnership Project (3GPP), 2025[2026-03-22]. https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=3216.
[18]	BENTALEB A, TAANI B, BEGEN A C, et al. A survey on bitrate adaptation schemes for streaming media over HTTP[J]. IEEE Communications Surveys & Tutorials, 2019, 21(1): 562–585. doi: 10.1109/COMST.2018.2862938.
[19]	曾焕强, 孔庆玮, 陈婧, 等. 沉浸式视频编码技术综述[J]. 电子与信息学报, 2024, 46(2): 602–614. doi: 10.11999/JEIT230097. ZENG Huanqiang, KONG Qingwei, CHEN Jing, et al. Overview of immersive video coding[J]. Journal of Electronics & Information Technology, 2024, 46(2): 602–614. doi: 10.11999/JEIT230097.
[20]	YEZNABAD Y F, HELFERT M, and MUNTEAN G M. QoE-driven cross-layer bitrate allocation approach for MEC-supported adaptive video streaming[J]. IEEE Transactions on Network and Service Management, 2024, 21(6): 6857–6874. doi: 10.1109/TNSM.2024.3453992.
[21]	ABDUL KADER L, AL-SHARGIE F, TARIQ U, et al. One-channel wearable mental stress state monitoring system[J]. Sensors, 2024, 24(16): 5373. doi: 10.3390/s24165373.
[22]	VIDAURRE C, KRÄMER N, BLANKERTZ B, et al. Time domain parameters as a feature for EEG-based brain–computer interfaces[J]. Neural Networks, 2009, 22(9): 1313–1319. doi: 10.1016/j.neunet.2009.07.020.
[23]	BLANKERTZ B, LEMM S, TREDER M, et al. Single-trial analysis and classification of ERP components—A tutorial[J]. NeuroImage, 2011, 56(2): 814–825. doi: 10.1016/j.neuroimage.2010.06.048.
[24]	SCHULMAN J, WOLSKI F, DHARIWAL P, et al. Proximal policy optimization algorithms[J]. arXiv: 1707.06347, 2017. doi: 10.48550/arXiv.1707.06347.