Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward Shaping
Abstract
Revised Deep Deterministic Policy Gradient and Proximal Policy Optimization algorithms with improved reward shaping demonstrate enhanced obstacle avoidance and navigation performance in robotic control compared to their original counterparts.
In this paper, we investigate the obstacle avoidance and navigation problem in the robotic control area. For solving such a problem, we propose revised Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization algorithms with an improved reward shaping technique. We compare the performances between the original DDPG and PPO with the revised version of both on simulations with a real mobile robot and demonstrate that the proposed algorithms achieve better results.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper
Collections including this paper 0
No Collection including this paper