File size: 1,443 Bytes
77c626a
 
 
 
 
 
 
d2cd292
 
77c626a
 
3d07200
77c626a
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
---
tags:
- deep-reinforcement-learning
- reinforcement-learning
- decision-transformer
- gym-continous-control

pipeline_tag: reinforcement-learning

---
# Decision Transformer model trained on medium trajectories sampled from the Gym HalfCheetah environment
This is a trained [Decision Transformer](https://arxiv.org/abs/2106.01345) model trained on medium trajectories sampled from the Gym HalfCheetah environment

<video src="https://huggingface.co/edbeeching/decision-transformer-gym-halfcheetah-medium/resolve/main/halfcheetah-medium.mp4" controls autoplay loop></video>

The following normlization coeficients are required to use this model:

mean = [-0.06845774,  0.01641455, -0.18354906, -0.27624607, -0.34061527, -0.09339716, -0.21321271, -0.08774239,  5.1730075,  -0.04275195, -0.03610836,  0.14053793, 0.06049833,  0.09550975,  0.067391,    0.00562739,  0.01338279] 

std = [0.07472999,  0.30234998,  0.3020731, 0.34417078, 0.17619242,  0.5072056,  0.25670078,  0.32948127,  1.2574149,   0.7600542,   1.9800916,   6.5653625,  7.4663677,   4.472223, 10.566964, 5.6719327,   7.498259]

See our [Blog Post](https://colab.research.google.com/drive/1K3UuajwoPY1MzRKNkONNRS3gS5DxZ-qF?usp=sharing),  [Colab notebook](https://colab.research.google.com/drive/1K3UuajwoPY1MzRKNkONNRS3gS5DxZ-qF?usp=sharing) or [Example Script](https://github.com/huggingface/transformers/tree/main/examples/research_projects/decision_transformer) for usage.