arth-shukla
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -6,6 +6,13 @@ viewer: false
|
|
6 |
|
7 |
# Model Checkpoints for ManiSkill-HAB
|
8 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
9 |
RL (SAC, PPO) and IL (BC) baselines for ManiSkill-HAB. Each checkpoint includes a torch checkpoint `policy.pt` (model, optimizer/scheduler state, other trainable parameters) and a train config `config.yml` with hyperparemeters and env kwargs.
|
10 |
|
11 |
RL Pick/Place policies are trained using SAC due to improved performance, while Open/Close is trained with PPO for wall-time efficiency (see Appendix A.4.3). All-object RL policies are under `all/` directories, while per-object policies are under directories labeled by the object name. IL policies do not have per-object Pick/Place variants.
|
@@ -14,4 +21,23 @@ RL Pick/Place policies are trained using SAC due to improved performance, while
|
|
14 |
To download these policies, run the following:
|
15 |
```
|
16 |
huggingface-cli download arth-shukla/mshab_checkpoints --local-dir mshab_checkpoints
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
17 |
```
|
|
|
6 |
|
7 |
# Model Checkpoints for ManiSkill-HAB
|
8 |
|
9 |
+
**[Paper](https://arxiv.org/abs/2412.13211)**
|
10 |
+
| **[Website](https://arth-shukla.github.io/mshab)**
|
11 |
+
| **[Code](https://github.com/arth-shukla/mshab)**
|
12 |
+
| **[Models](https://huggingface.co/arth-shukla/mshab_checkpoints)**
|
13 |
+
| **[(Full) Dataset](https://arth-shukla.github.io/mshab/#dataset-section)**
|
14 |
+
| **[Supplementary](https://sites.google.com/view/maniskill-hab)**
|
15 |
+
|
16 |
RL (SAC, PPO) and IL (BC) baselines for ManiSkill-HAB. Each checkpoint includes a torch checkpoint `policy.pt` (model, optimizer/scheduler state, other trainable parameters) and a train config `config.yml` with hyperparemeters and env kwargs.
|
17 |
|
18 |
RL Pick/Place policies are trained using SAC due to improved performance, while Open/Close is trained with PPO for wall-time efficiency (see Appendix A.4.3). All-object RL policies are under `all/` directories, while per-object policies are under directories labeled by the object name. IL policies do not have per-object Pick/Place variants.
|
|
|
21 |
To download these policies, run the following:
|
22 |
```
|
23 |
huggingface-cli download arth-shukla/mshab_checkpoints --local-dir mshab_checkpoints
|
24 |
+
```
|
25 |
+
|
26 |
+
If you use ManiSkill-HAB in your work, please consider citing the following:
|
27 |
+
|
28 |
+
```
|
29 |
+
@article{shukla2024maniskillhab,
|
30 |
+
author = {Arth Shukla and Stone Tao and Hao Su},
|
31 |
+
title = {ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks},
|
32 |
+
journal = {CoRR},
|
33 |
+
volume = {abs/2412.13211},
|
34 |
+
year = {2024},
|
35 |
+
url = {https://doi.org/10.48550/arXiv.2412.13211},
|
36 |
+
doi = {10.48550/ARXIV.2412.13211},
|
37 |
+
eprinttype = {arXiv},
|
38 |
+
eprint = {2412.13211},
|
39 |
+
timestamp = {Mon, 09 Dec 2024 01:29:24 +0100},
|
40 |
+
biburl = {https://dblp.org/rec/journals/corr/abs-2412-13211.bib},
|
41 |
+
bibsource = {dblp computer science bibliography, https://dblp.org}
|
42 |
+
}
|
43 |
```
|