Journal of System Simulation
Abstract
Abstract: Method integrating the PPO algorithm with Transformer network architecture is proposed, and curriculum learning strategy is introduced to solve the difficult training convergence and low efficiency of traditional RL methods in complex and dynamic high-degree-of-freedom tasks such as robotic arm ball catching. The Transformer is employed to effectively capture the complex high-dimensional dependency between the robotic arm's state space, ball trajectory, and environmental physical parameters. Curriculum learning progressively increases catching difficulty by designing training tasks from simple to complex objectives. The experimental results show this method increases the ball-catching success rate by over 60% compared to the traditional PPO and features excellent accuracy at tracking balls with real world disturbance characteristics. This method not only enhances the performance and efficiency of dynamic catching for robotic arms in both simulated and real-world disturbance conditions, but also provides a novel solution for complex task control in real-world scenarios.
Recommended Citation
Zhang, Ziyao and Ji, Yunfeng
(2026)
"Simulation of Robotic Arm Ball-catching Strategy Based on Curriculum RL of Transformer,"
Journal of System Simulation: Vol. 38:
Iss.
2, Article 6.
DOI: 10.16182/j.issn1004731x.joss.25-0768
Available at:
https://dc-china-simulation.researchcommons.org/journal/vol38/iss2/6
First Page
321
Last Page
331
CLC
TP391.9
Recommended Citation
Zhang Ziyao, Ji Yunfeng. Simulation of Robotic Arm Ball-catching Strategy Based on Curriculum RL of Transformer[J]. Journal of System Simulation, 2026, 38(2): 321-331.
DOI
10.16182/j.issn1004731x.joss.25-0768
Included in
Artificial Intelligence and Robotics Commons, Computer Engineering Commons, Numerical Analysis and Scientific Computing Commons, Operations Research, Systems Engineering and Industrial Engineering Commons, Systems Science Commons