pushing model

Browse files

Files changed (7) hide show

README.md +64 -0
dqn_jax.cleanrl_model +0 -0
events.out.tfevents.1671208305.pop-os.1580220.0 +3 -0
replay.mp4 +0 -0
videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-0.mp4 +0 -0
videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-1.mp4 +0 -0
videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-8.mp4 +0 -0

README.md ADDED Viewed

	@@ -0,0 +1,64 @@

+---
+tags:
+- CartPole-v1
+- deep-reinforcement-learning
+- reinforcement-learning
+- custom-implementation
+library_name: cleanrl
+model-index:
+- name: DQN
+  results:
+  - task:
+      type: reinforcement-learning
+      name: reinforcement-learning
+    dataset:
+      name: CartPole-v1
+      type: CartPole-v1
+    metrics:
+    - type: mean_reward
+      value: 36.50 +/- 11.32
+      name: mean_reward
+      verified: false
+---
+# (CleanRL) **DQN** Agent Playing **CartPole-v1**
+This is a trained model of a DQN agent playing CartPole-v1.
+The model was trained by using [CleanRL](https://github.com/vwxyzjn/cleanrl) and the most up-to-date training code can be
+found [here](https://github.com/vwxyzjn/cleanrl/blob/master/cleanrl/dqn_jax.py).
+## Command to reproduce the training
+```bash
+curl -OL https://huggingface.co/vwxyzjn/CartPole-v1-dqn_jax-seed1/raw/main/dqn.py
+curl -OL https://huggingface.co/vwxyzjn/CartPole-v1-dqn_jax-seed1/raw/main/pyproject.toml
+curl -OL https://huggingface.co/vwxyzjn/CartPole-v1-dqn_jax-seed1/raw/main/poetry.lock
+poetry install --all-extras
+python dqn_jax.py --save-model --upload-model --hf-entity vwxyzjn --total-timesteps 1000
+```
+# Hyperparameters
+```python
+{'batch_size': 128,
+ 'buffer_size': 10000,
+ 'capture_video': False,
+ 'end_e': 0.05,
+ 'env_id': 'CartPole-v1',
+ 'exp_name': 'dqn_jax',
+ 'exploration_fraction': 0.5,
+ 'gamma': 0.99,
+ 'hf_entity': 'vwxyzjn',
+ 'learning_rate': 0.00025,
+ 'learning_starts': 10000,
+ 'save_model': True,
+ 'seed': 1,
+ 'start_e': 1,
+ 'target_network_frequency': 500,
+ 'total_timesteps': 1000,
+ 'track': False,
+ 'train_frequency': 10,
+ 'upload_model': True,
+ 'wandb_entity': None,
+ 'wandb_project_name': 'cleanRL'}
+```

dqn_jax.cleanrl_model ADDED Viewed

Binary file (43.9 kB). View file

events.out.tfevents.1671208305.pop-os.1580220.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fb054c09e3c7a920037cf9695dbb707fdc85ce5da6bcb1195825bcf793fb1da5
+size 5677

replay.mp4 ADDED Viewed

Binary file (5.85 kB). View file

videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-0.mp4 ADDED Viewed

Binary file (4.04 kB). View file

videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-1.mp4 ADDED Viewed

Binary file (6.25 kB). View file

videos/CartPole-v1__dqn_jax__1__1671208305-eval/rl-video-episode-8.mp4 ADDED Viewed

Binary file (5.85 kB). View file