vwxyzjn commited on
Commit
d02f85a
1 Parent(s): e2e977f

pushing model

Browse files
README.md ADDED
@@ -0,0 +1,56 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - CartPole-v1
4
+ - deep-reinforcement-learning
5
+ - reinforcement-learning
6
+ - custom-implementation
7
+ model-index:
8
+ - name: DQN
9
+ results:
10
+ - task:
11
+ type: reinforcement-learning
12
+ name: reinforcement-learning
13
+ dataset:
14
+ name: CartPole-v1
15
+ type: CartPole-v1
16
+ metrics:
17
+ - type: mean_reward
18
+ value: 70.00 +/- 19.16
19
+ name: mean_reward
20
+ verified: false
21
+ ---
22
+
23
+ # (CleanRL) **DQN** Agent Playing **CartPole-v1**
24
+
25
+ This is a trained model of a DQN agent playing CartPole-v1.
26
+ The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the training code can be
27
+ found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/dqn.py).
28
+
29
+
30
+ # Hyperparameters
31
+ ```python
32
+ {'batch_size': 128,
33
+ 'buffer_size': 10000,
34
+ 'capture_video': False,
35
+ 'cuda': False,
36
+ 'end_e': 0.05,
37
+ 'env_id': 'CartPole-v1',
38
+ 'exp_name': 'dqn',
39
+ 'exploration_fraction': 0.5,
40
+ 'gamma': 0.99,
41
+ 'hf_entity': '',
42
+ 'learning_rate': 0.00025,
43
+ 'learning_starts': 10000,
44
+ 'save_model': True,
45
+ 'seed': 1,
46
+ 'start_e': 1,
47
+ 'target_network_frequency': 500,
48
+ 'torch_deterministic': True,
49
+ 'total_timesteps': 10000,
50
+ 'track': False,
51
+ 'train_frequency': 10,
52
+ 'upload_model': True,
53
+ 'wandb_entity': None,
54
+ 'wandb_project_name': 'cleanRL'}
55
+ ```
56
+
events.out.tfevents.1666103406.pop-os.2423.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7f62a6f05d7eee3afe6daec0eeaa931504edeaee9b4f2f5aea25395aa9659212
3
+ size 28923
events.out.tfevents.1666103408.pop-os.2423.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6c243811698b8eef01c0d5fe3756f620dbc9b18c62ddc156977e25b80c0c9f5f
3
+ size 618
q_network.pth ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:37569e0ac54f61ecf5dde3387ec06184d48efe905501c80dd239668e91bf5884
3
+ size 45783