sadra-barikbin commited on
Commit
51fcb1d
1 Parent(s): 9fe1fca

Push agent to the Hub

Browse files
README.md ADDED
@@ -0,0 +1,40 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - LunarLander-v2
4
+ - ppo
5
+ - deep-reinforcement-learning
6
+ - reinforcement-learning
7
+ - custom-implementation
8
+ - deep-rl-course
9
+ model-index:
10
+ - name: PPO
11
+ results:
12
+ - task:
13
+ type: reinforcement-learning
14
+ name: reinforcement-learning
15
+ dataset:
16
+ name: LunarLander-v2
17
+ type: LunarLander-v2
18
+ metrics:
19
+ - type: mean_reward
20
+ value: -101.69 +/- 95.52
21
+ name: mean_reward
22
+ verified: false
23
+ ---
24
+
25
+ # PPO Agent Playing LunarLander-v2
26
+
27
+ This is a trained model of a PPO agent playing LunarLander-v2.
28
+
29
+ # Hyperparameters
30
+ ```python
31
+ {'max_frames': 400000
32
+ 'gamma': 0.95
33
+ 'PPO_epsilon': 0.2
34
+ 'max_t': 99
35
+ 'target_nets_update_freq': 50
36
+ 'lr': 0.00025
37
+ 'entropy_coef': 0.01
38
+ 'env_id': 'LunarLander-v2'}
39
+ ```
40
+
logs/events.out.tfevents.1685131121.f0828af4afe6.1228.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:33de27691f684efdfbf546df0102a7100f2552ae365cf7d2f2484e763e360451
3
+ size 88
logs/events.out.tfevents.1685131323.f0828af4afe6.1228.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c21bf1e7bbdaa43e8247e8aec57e098a797cc38dafd081c14c3da4a20272c7cc
3
+ size 125631
logs/events.out.tfevents.1685132511.f0828af4afe6.1228.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d57354d88ff90a5c5e383bb68a18fa6b92c236373575873160749dc09f6d0821
3
+ size 134009
logs/events.out.tfevents.1685132954.f0828af4afe6.1228.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0bdcdcf4a03df2dd7da6cf3d1b8c2924d27e06d035878cb1583c898eb4a04cc8
3
+ size 240268
logs/events.out.tfevents.1685134337.f0828af4afe6.1228.4 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:05bac3483d65f8da16eb96c1fa8df1177c5f5b90c8cb0adb3bb2693f7f6c9c3a
3
+ size 259797
model.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:32fa5b2589a720db0651c25328263740e2e51b10622a2b696f0c274f88686123
3
+ size 5319
replay.mp4 ADDED
Binary file (52.3 kB). View file
 
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"env_id": "LunarLander-v2", "mean_reward": -101.68873075582687, "std_reward": 95.51958858948541, "n_evaluation_episodes": 10, "eval_datetime": "2023-05-26T21:21:36.222293"}