merve HF staff commited on
Commit
5697a92
1 Parent(s): d2a34cd

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -37,4 +37,4 @@ Second, it uses Experience Replay.
37
 
38
  We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.
39
 
40
- ![pendulum_gif](https://imgur.com/eEH8Cz6)
 
37
 
38
  We store list of tuples (state, action, reward, next_state), and instead of learning only from recent experience, we learn from sampling all of our experience accumulated so far.
39
 
40
+ ![pendulum_gif](https://i.imgur.com/eEH8Cz6.gif)