nicklashansen
commited on
Commit
•
8fb2a82
1
Parent(s):
9895c7c
Update README.md
Browse files
README.md
CHANGED
@@ -16,7 +16,7 @@ Official release of TD-MPC2 model checkpoints for the paper
|
|
16 |
|
17 |
[Nicklas Hansen](https://nicklashansen.github.io), [Hao Su](https://cseweb.ucsd.edu/~haosu)\*, [Xiaolong Wang](https://xiaolonw.github.io)\* (UC San Diego)
|
18 |
|
19 |
-
**Quick links:** [[Website]](https://nicklashansen.github.io/td-mpc2) [[Paper]](https://arxiv.org/abs/2310.16828) [[Dataset]](https://
|
20 |
|
21 |
|
22 |
## Model Details
|
@@ -33,7 +33,7 @@ We open-source a total of 324 TD-MPC2 model checkpoints, including 12 multi-task
|
|
33 |
### Model Sources
|
34 |
|
35 |
- **Repository:** [https://github.com/nicklashansen/tdmpc2](https://github.com/nicklashansen/tdmpc2)
|
36 |
-
- **Paper:** [https://
|
37 |
|
38 |
## Uses
|
39 |
|
@@ -57,7 +57,7 @@ We describe the training procedure for single-task and multi-task model checkpoi
|
|
57 |
|
58 |
### Training Procedure (Single-task)
|
59 |
|
60 |
-
Single-task checkpoints are trained using the [official implementation](https://github.com/nicklashansen/tdmpc2) with default hyperparameters. All models have 5M parameters. Most, but not all, models are trained until convergence. Refer to the individual task curves in our [paper](https://
|
61 |
|
62 |
### Training Procedure (Multi-task)
|
63 |
|
|
|
16 |
|
17 |
[Nicklas Hansen](https://nicklashansen.github.io), [Hao Su](https://cseweb.ucsd.edu/~haosu)\*, [Xiaolong Wang](https://xiaolonw.github.io)\* (UC San Diego)
|
18 |
|
19 |
+
**Quick links:** [[Website]](https://nicklashansen.github.io/td-mpc2) [[Paper]](https://arxiv.org/abs/2310.16828) [[Dataset]](https://huggingface.co/datasets/nicklashansen/tdmpc2)
|
20 |
|
21 |
|
22 |
## Model Details
|
|
|
33 |
### Model Sources
|
34 |
|
35 |
- **Repository:** [https://github.com/nicklashansen/tdmpc2](https://github.com/nicklashansen/tdmpc2)
|
36 |
+
- **Paper:** [https://arxiv.org/abs/2310.16828](https://arxiv.org/abs/2310.16828)
|
37 |
|
38 |
## Uses
|
39 |
|
|
|
57 |
|
58 |
### Training Procedure (Single-task)
|
59 |
|
60 |
+
Single-task checkpoints are trained using the [official implementation](https://github.com/nicklashansen/tdmpc2) with default hyperparameters. All models have 5M parameters. Most, but not all, models are trained until convergence. Refer to the individual task curves in our [paper](https://arxiv.org/abs/2310.16828) for a detailed breakdown of model performance on each task.
|
61 |
|
62 |
### Training Procedure (Multi-task)
|
63 |
|