arxiv:2003.12863

Obstacle Avoidance and Navigation Utilizing Reinforcement Learning with Reward Shaping

Published on Apr 10, 2020

Authors:

Abstract

Revised Deep Deterministic Policy Gradient and Proximal Policy Optimization algorithms with improved reward shaping demonstrate enhanced obstacle avoidance and navigation performance in robotic control compared to their original counterparts.

Generated by Qwen/Qwen2.5-Coder-32B-Instruct

In this paper, we investigate the obstacle avoidance and navigation problem in the robotic control area. For solving such a problem, we propose revised Deep Deterministic Policy Gradient (DDPG) and Proximal Policy Optimization algorithms with an improved reward shaping technique. We compare the performances between the original DDPG and PPO with the revised version of both on simulations with a real mobile robot and demonstrate that the proposed algorithms achieve better results.

View arXiv page View PDF Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

Models citing this paper 0

No model linking this paper

Cite arxiv.org/abs/2003.12863 in a model README.md to link it from this page.

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2003.12863 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2003.12863 in a Space README.md to link it from this page.

Collections including this paper 0

No Collection including this paper

Add this paper to a collection to link it from this page.