Journal of System Simulation
Abstract
Abstract: Deep reinforcement learning (DRL) has achieved remarkable success in various domains. Nevertheless, existing policy networks in DRL still face significant challenges in areas such as generalizability, multi-task adaptability, and sample efficiency. Policy representation, as a crucial research direction for enhancing DRL capabilities, aims to improve an agent's adaptability to environmental changes and novel tasks by constructing more efficient and generalizable forms of policy expression. This paper provided a concise overview of key research advances in the field of policy representation. It introduced diverse policy architectures, ranging from traditional multi-layer perceptron (MLP) -based policies to those based on pointer networks, sequence generation models, diffusion models, hypernetworks, modular designs, mixture of experts models, and cross-modal policies based on serialized tokens. The paper sorted out cutting-edge research concerning policy representation methods, specifically addressing how semantic information within policy inputs and intermediate representations is encoded and optimized. It concluded with a summary and discussed prospects for future development.
Recommended Citation
Chen, Zhen; Wu, Zhuoyi; and Zhang, Lin
(2025)
"Research on Policy Representation in Deep Reinforcement Learning,"
Journal of System Simulation: Vol. 37:
Iss.
7, Article 10.
DOI: 10.16182/j.issn1004731x.joss.25-0533
Available at:
https://dc-china-simulation.researchcommons.org/journal/vol37/iss7/10
First Page
1753
Last Page
1769
CLC
TP391.9
Recommended Citation
Chen Zhen, Wu Zhuoyi, Zhang Lin. Research on Policy Representation in Deep Reinforcement Learning[J]. Journal of System Simulation, 2025, 37(7): 1753-1769
DOI
10.16182/j.issn1004731x.joss.25-0533
Included in
Artificial Intelligence and Robotics Commons, Computer Engineering Commons, Numerical Analysis and Scientific Computing Commons, Operations Research, Systems Engineering and Industrial Engineering Commons, Systems Science Commons