Reinforcement Learning-Based Algorithm to Avoid Obstacles by the Anthropomorphic Robotic Arm
[ 1 ] Instytut Technologii Mechanicznej, Wydział Inżynierii Mechanicznej, Politechnika Poznańska | [ P ] pracownik
2022
artykuł naukowy
angielski
- obstacle avoidance
- positioning
- robotic arm
- reinforcement learning
EN In this paper, the application of the policy gradient Reinforcement Learning-based (RL) method for obstacle avoidance is proposed. This method was successfully used to control the movements of a robot using trial-and-error interactions with its environment. In this paper, an approach based on a Deep Deterministic Policy Gradient (DDPG) algorithm combined with a Hindsight Experience Replay (HER) algorithm for avoiding obstacles has been investigated. In order to ensure that the robot avoids obstacles and reaches the desired position as quickly and as accurately as possible, a special approach to the training and architecture of two RL agents working simultaneously was proposed. The implementation of this RL-based approach was first implemented in a simulation environment, which was used to control the 6-axis robot simulation model. Then, the same algorithm was used to control a real 6-DOF (degrees of freedom) robot. The results obtained in the simulation were compared with results obtained in laboratory conditions.
30.06.2022
6629-1 - 6629-24
Article Number: 6629
CC BY (uznanie autorstwa)
otwarte czasopismo
ostateczna wersja opublikowana
w momencie opublikowania
100
2,7