Simulation of Robotic Arm Grasping Control Based on Proximal Policy Optimization Algorithm

Zhizhuo Zhang; Change Zheng

doi:10.1088/1742-6596/2203/1/012065

Journal of Physics: Conference Series

Paper • The following article is Open access

Simulation of Robotic Arm Grasping Control Based on Proximal Policy Optimization Algorithm

Zhizhuo Zhang¹ and Change Zheng¹

Published under licence by IOP Publishing Ltd
Journal of Physics: Conference Series, Volume 2203, International Conference on Robotics Automation and Intelligent Control (ICRAIC 2021) 26/11/2021 - 28/11/2021 Wuhan Citation Zhizhuo Zhang and Change Zheng 2022 J. Phys.: Conf. Ser. 2203 012065 DOI 10.1088/1742-6596/2203/1/012065

Download Article PDF

Article metrics

1059 Total downloads

Author e-mails

zhangzhizhuo@bjfu.edu.cn

Author affiliations

¹ The school of Technology, Beijing Forestry University, Beijing, China

Buy this article in print

Journal RSS

Sign up for new issue notifications

Abstract

There are many kinds of inverse kinematics solutions for robots. Deep reinforcement learning can make the robot spend a short time to find the optimal inverse kinematics solution. Aiming at the problem of sparse rewards in the process of deep reinforcement learning, this paper proposes an improved PPO algorithm. Firstly, built a simulation environment for the operation of the robotic arm. Secondly, use a convolutional neural network to process the data read by the camera of the robotic arm, obtaining a network about Actor and Critic. Thirdly, based on the principle of inverse kinematics of the robotic arm and the reward mechanism in deep reinforcement learning, design a hierarchical reward function containing motion accuracy to promote the convergence of the PPO algorithm. Finally, compare the improved PPO algorithm with the traditional PPO algorithm. The results show that the improved PPO algorithm has improved both the convergence speed and the operating accuracy.

Export citation and abstract BibTeX RIS

Previous article in issue

Next article in issue

Content from this work may be used under the terms of the Creative Commons Attribution 3.0 licence. Any further distribution of this work must maintain attribution to the author(s) and the title of the work, journal citation and DOI.

Please wait… references are loading.

Simulation of Robotic Arm Grasping Control Based on Proximal Policy Optimization Algorithm

Article metrics

Share this article

Author e-mails

Author affiliations

Abstract