基于D3QN算法的电力无线传感网络用户满意度优化

doi:10.12158/j.2096-3203.2026.03.007

首页 > 过刊浏览>2026年第45卷第3期 >57-62,115. DOI:10.12158/j.2096-3203.2026.03.007

基于D3QN算法的电力无线传感网络用户满意度优化
DOI:
                        10.12158/j.2096-3203.2026.03.007
                    
CSTR:
                        
                    
作者:
                        
                        
                    
作者单位:
作者简介:
通讯作者:
中图分类号:TM734
基金项目:国家重点研发计划资助项目（2024YFB3213400）

User satisfaction optimization of power wireless sensor networks based on the D3QN algorithm

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

在电力无线传感网络（power wireless sensor network, PWSN）中，多用户上行并发接入受限于有限的频谱与功率资源，且不同监测业务对通信可靠性与时延的需求存在显著差异，导致资源调度难以兼顾整体效能与用户体验。文中在正交频分复用（orthogonal frequency division multiplexing, OFDM）上行架构中构建一种能够在异构业务环境实现服务质量差异化保障的联合资源分配机制，同时设计可量化的用户满意度函数，将子载波与功率联合优化建模为一个马尔科夫决策过程（Markov decision process, MDP），并引入双决斗深度Q网络（dueling double deep Q network, D3QN）算法动态调整资源分配策略。此外，为进一步降低计算复杂度，文中提出动作空间下采样机制，能有效提升训练效率。仿真结果表明，文中算法在不同节点规模与子载波配置下均能够快速收敛，相较于传统深度Q网络（deep Q network, DQN）、随机分配与均匀分配方法，文中算法能显著提升用户满意度。

Abstract:

In power wireless sensor networks (PWSNs), concurrent uplink access by multiple users is constrained by limited spectrum and power resources, while heterogeneous monitoring services exhibit markedly different requirements in terms of reliability and latency. These factors make it challenging for resource scheduling to simultaneously satisfy overall system efficiency and user-perceived quality. In this work, a joint resource allocation mechanism capable of providing differentiated quality-of-service guarantees under heterogeneous service demands is formulated within an uplink orthogonal frequency division multiplexing (OFDM) framework. A quantifiable user-satisfaction function is designed, and the joint optimization of subcarrier and power allocation is modeled as a Markov decision process (MDP). A dueling double deep Q network (D3QN) algorithm is further introduced to dynamically adjust the allocation strategy. In addition, an action-space down-sampling mechanism is proposed to reduce computational complexity and enhance training efficiency. Simulation results demonstrate that the proposed algorithm achieves fast convergence under various node densities and subcarrier configurations, and yields significant improvements in user satisfaction compared with conventional DQN, random allocation, and uniform allocation methods.

参考文献

相似文献

引证文献

引用本文

杨景刚,胡成博,朱雪琼,王真,刘洪,李慧.基于D3QN算法的电力无线传感网络用户满意度优化[J].电力工程技术,2026,45(3):57-62,115. YANG Jinggang, HU Chengbo, ZHU Xueqiong, WANG Zhen, LIU Hong, LI Hui. User satisfaction optimization of power wireless sensor networks based on the D3QN algorithm[J]. Electric Power Engineering Technology,2026,45(3):57-62,115.

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2025-09-27
最后修改日期:2025-12-17
在线发布日期: 2026-03-31
出版日期: 2026-03-28

首页

期刊简介

编委会

道德声明与制度

投稿须知

开放获取声明

中英文目录

联系我们

ENGLISH

引用本文

分享

相关视频

文章指标

历史

文章二维码