|
--- |
|
tags: |
|
- decisionTransformer |
|
- deep reinforcement |
|
datasets: |
|
- edbeeching/decision_transformer_gym_replay |
|
license: |
|
- mit |
|
--- |
|
|
|
### Running training |
|
- Num examples = 1000 |
|
- Num Epochs = 120 |
|
- Instantaneous batch size per device = 64 |
|
- Total train batch size = 64 |
|
- Gradient Accumulation steps = 1 |
|
- Total optimization steps = 1920 |
|
|
|
### Train Output |
|
- global_step = 1920 |
|
- train_runtime = 1849.2158 |
|
- train_samples_per_second = 64.892 |
|
- train_steps_per_second = 1.038 |
|
- train_loss = 0.04717305501302083 |
|
- epoch = 120.0 |
|
|
|
### Dataset |
|
- edbeeching/decision_transformer_gym_replay |
|
- halfcheetah-expert-v2 |