Papers
arxiv:2003.12863

Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward Shaping

Published on Apr 10, 2020
Authors:
,

Abstract

Revised Deep Deterministic Policy Gradient and Proximal Policy Optimization algorithms with improved reward shaping demonstrate enhanced obstacle avoidance and navigation performance in robotic control compared to their original counterparts.

In this paper, we investigate the obstacle avoidance and navigation problem in the robotic control area. For solving such a problem, we propose revised Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization algorithms with an improved reward shaping technique. We compare the performances between the original DDPG and PPO with the revised version of both on simulations with a real mobile robot and demonstrate that the proposed algorithms achieve better results.

Community

Sign up or log in to comment

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2003.12863 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2003.12863 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2003.12863 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.