nicklashansen commited on
Commit
8fb2a82
1 Parent(s): 9895c7c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -3
README.md CHANGED
@@ -16,7 +16,7 @@ Official release of TD-MPC2 model checkpoints for the paper
16
 
17
  [Nicklas Hansen](https://nicklashansen.github.io), [Hao Su](https://cseweb.ucsd.edu/~haosu)\*, [Xiaolong Wang](https://xiaolonw.github.io)\* (UC San Diego)
18
 
19
- **Quick links:** [[Website]](https://nicklashansen.github.io/td-mpc2) [[Paper]](https://arxiv.org/abs/2310.16828) [[Dataset]](https://www.tdmpc2.com/dataset)
20
 
21
 
22
  ## Model Details
@@ -33,7 +33,7 @@ We open-source a total of 324 TD-MPC2 model checkpoints, including 12 multi-task
33
  ### Model Sources
34
 
35
  - **Repository:** [https://github.com/nicklashansen/tdmpc2](https://github.com/nicklashansen/tdmpc2)
36
- - **Paper:** [https://www.tdmpc2.com](https://arxiv.org/abs/2310.16828)
37
 
38
  ## Uses
39
 
@@ -57,7 +57,7 @@ We describe the training procedure for single-task and multi-task model checkpoi
57
 
58
  ### Training Procedure (Single-task)
59
 
60
- Single-task checkpoints are trained using the [official implementation](https://github.com/nicklashansen/tdmpc2) with default hyperparameters. All models have 5M parameters. Most, but not all, models are trained until convergence. Refer to the individual task curves in our [paper](https://www.tdmpc2.com) for a detailed breakdown of model performance on each task.
61
 
62
  ### Training Procedure (Multi-task)
63
 
 
16
 
17
  [Nicklas Hansen](https://nicklashansen.github.io), [Hao Su](https://cseweb.ucsd.edu/~haosu)\*, [Xiaolong Wang](https://xiaolonw.github.io)\* (UC San Diego)
18
 
19
+ **Quick links:** [[Website]](https://nicklashansen.github.io/td-mpc2) [[Paper]](https://arxiv.org/abs/2310.16828) [[Dataset]](https://huggingface.co/datasets/nicklashansen/tdmpc2)
20
 
21
 
22
  ## Model Details
 
33
  ### Model Sources
34
 
35
  - **Repository:** [https://github.com/nicklashansen/tdmpc2](https://github.com/nicklashansen/tdmpc2)
36
+ - **Paper:** [https://arxiv.org/abs/2310.16828](https://arxiv.org/abs/2310.16828)
37
 
38
  ## Uses
39
 
 
57
 
58
  ### Training Procedure (Single-task)
59
 
60
+ Single-task checkpoints are trained using the [official implementation](https://github.com/nicklashansen/tdmpc2) with default hyperparameters. All models have 5M parameters. Most, but not all, models are trained until convergence. Refer to the individual task curves in our [paper](https://arxiv.org/abs/2310.16828) for a detailed breakdown of model performance on each task.
61
 
62
  ### Training Procedure (Multi-task)
63