JiajingChen
/

1

Reinforcement Learning

sample-factory

TensorBoard

deep-reinforcement-learning

Eval Results

Model card Files Files and versions Metrics Training metrics Community

JiajingChen commited on Feb 18, 2024

Commit

b35bb01

•

1 Parent(s): 22b1fc0

Upload README.md with huggingface_hub

Browse files

Files changed (1) hide show

README.md +35 -16

README.md CHANGED Viewed

@@ -1,37 +1,56 @@
 ---
-library_name: stable-baselines3
 tags:
-- PandaReachDense-v3
 - deep-reinforcement-learning
 - reinforcement-learning
-- stable-baselines3
 model-index:
-- name: A2C
   results:
   - task:
       type: reinforcement-learning
       name: reinforcement-learning
     dataset:
-      name: PandaReachDense-v3
-      type: PandaReachDense-v3
     metrics:
     - type: mean_reward
-      value: -0.21 +/- 0.08
       name: mean_reward
       verified: false
 ---
-# **A2C** Agent playing **PandaReachDense-v3**
-This is a trained model of a **A2C** agent playing **PandaReachDense-v3**
-using the [stable-baselines3 library](https://github.com/DLR-RM/stable-baselines3).
-## Usage (with Stable-baselines3)
-TODO: Add your code
-```python
-from stable_baselines3 import ...
-from huggingface_sb3 import load_from_hub
-...
 ```

 ---
+library_name: sample-factory
 tags:
 - deep-reinforcement-learning
 - reinforcement-learning
+- sample-factory
 model-index:
+- name: APPO
   results:
   - task:
       type: reinforcement-learning
       name: reinforcement-learning
     dataset:
+      name: doom_health_gathering_supreme
+      type: doom_health_gathering_supreme
     metrics:
     - type: mean_reward
+      value: 9.92 +/- 2.85
       name: mean_reward
       verified: false
 ---
+A(n) **APPO** model trained on the **doom_health_gathering_supreme** environment.
+This model was trained using Sample-Factory 2.0: https://github.com/alex-petrenko/sample-factory.
+Documentation for how to use Sample-Factory can be found at https://www.samplefactory.dev/
+## Downloading the model
+After installing Sample-Factory, download the model with:
 ```
+python -m sample_factory.huggingface.load_from_hub -r JiajingChen/1
+```
+## Using the model
+To run the model after download, use the `enjoy` script corresponding to this environment:
+```
+python -m .usr.local.lib.python3.10.dist-packages.colab_kernel_launcher --algo=APPO --env=doom_health_gathering_supreme --train_dir=./train_dir --experiment=1
+```
+You can also upload models to the Hugging Face Hub using the same script with the `--push_to_hub` flag.
+See https://www.samplefactory.dev/10-huggingface/huggingface/ for more details
+## Training with this model
+To continue training with this model, use the `train` script corresponding to this environment:
+```
+python -m .usr.local.lib.python3.10.dist-packages.colab_kernel_launcher --algo=APPO --env=doom_health_gathering_supreme --train_dir=./train_dir --experiment=1 --restart_behavior=resume --train_for_env_steps=10000000000
+```
+Note, you may have to adjust `--train_for_env_steps` to a suitably high number as the experiment will resume at the number of steps it concluded at.