keras-io
/

deep-deterministic-policy-gradient

reinforcement learning

deep deterministic policy gradient

Model card Files Files and versions Community

merve HF staff commited on Dec 15, 2021

Commit

4ed6074

•

1 Parent(s): ed1e753

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -12,6 +12,8 @@ This repo contains the model and the notebook [to this Keras example on Deep Det
 Full credits to: [Hemant Singh](https://github.com/amifunny)
 ## Background Information
 Deep Deterministic Policy Gradient (DDPG) is a model-free off-policy algorithm for learning continous actions.
@@ -39,4 +41,3 @@ Second, it uses Experience Replay.
 We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.
-![pendulum_gif](https://i.imgur.com/eEH8Cz6.gif)

 Full credits to: [Hemant Singh](https://github.com/amifunny)
+![pendulum_gif](https://i.imgur.com/eEH8Cz6.gif)
 ## Background Information
 Deep Deterministic Policy Gradient (DDPG) is a model-free off-policy algorithm for learning continous actions.
 We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.