hishamcse commited on
Commit
5ece54f
1 Parent(s): 450ad75

Upload folder using huggingface_hub

Browse files
Files changed (4) hide show
  1. README.md +29 -0
  2. agent.pt +3 -0
  3. hyperparameters.json +1 -0
  4. results.json +1 -0
README.md ADDED
@@ -0,0 +1,29 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - SuperMarioBros-v0
4
+ - RND
5
+ - CNN
6
+ - reinforcement-learning
7
+ - custom-implementation
8
+ - atari
9
+ - MontezumaRevengeNoFrameskip-v4
10
+ model-index:
11
+ - name: RND-SuperMarioBros-v0
12
+ results:
13
+ - task:
14
+ type: reinforcement-learning
15
+ name: reinforcement-learning
16
+ dataset:
17
+ name: SuperMarioBros-v0
18
+ type: SuperMarioBros-v0
19
+ metrics:
20
+ - type: mean_reward
21
+ value: 1998.70 +/- 315.66
22
+ name: mean_reward
23
+ verified: false
24
+ ---
25
+
26
+ # **RND with CNN** Agent playing **SuperMarioBros-v0**
27
+ This is a trained model of a **RND-CNN** agent playing **SuperMarioBros-v0** .
28
+ To learn to use this model and train yours check this notebook on kaggle: https://www.kaggle.com/code/syedjarullahhisham/drl-extra-personal-unit-5-rnd-montezuma-mario-bros
29
+
agent.pt ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:29ddf058a32db98bee9716c416a6af9c7c90a8edb11e1a09c1f10cc3e51d0554
3
+ size 21167174
hyperparameters.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"trainmethod": "RND", "envid": "SuperMarioBros-v0", "maxstepperepisode": 18000, "learningrate": 0.0001, "numenv": 128, "numstep": 128, "gamma": 0.999, "intgamma": 0.99, "lambda": 0.95, "usegae": true, "clipgradnorm": 0.5, "entropy": 0.001, "epoch": 4, "minibatch": 4, "ppoeps": 0.1, "extcoef": 5.0, "intcoef": 1.0, "stickyaction": true, "actionprob": 0.25, "lifedone": false, "obsnormstep": 50}
results.json ADDED
@@ -0,0 +1 @@
 
 
1
+ {"env_id": "SuperMarioBros-v0", "mean_reward": 1998.7, "n_evaluation_episodes": 10, "eval_datetime": "2024-07-03T21:57:06.394678"}