arth-shukla commited on
Commit
1928b2d
·
verified ·
1 Parent(s): 49c03ef

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +26 -0
README.md CHANGED
@@ -6,6 +6,13 @@ viewer: false
6
 
7
  # Model Checkpoints for ManiSkill-HAB
8
 
 
 
 
 
 
 
 
9
  RL (SAC, PPO) and IL (BC) baselines for ManiSkill-HAB. Each checkpoint includes a torch checkpoint `policy.pt` (model, optimizer/scheduler state, other trainable parameters) and a train config `config.yml` with hyperparemeters and env kwargs.
10
 
11
  RL Pick/Place policies are trained using SAC due to improved performance, while Open/Close is trained with PPO for wall-time efficiency (see Appendix A.4.3). All-object RL policies are under `all/` directories, while per-object policies are under directories labeled by the object name. IL policies do not have per-object Pick/Place variants.
@@ -14,4 +21,23 @@ RL Pick/Place policies are trained using SAC due to improved performance, while
14
  To download these policies, run the following:
15
  ```
16
  huggingface-cli download arth-shukla/mshab_checkpoints --local-dir mshab_checkpoints
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
17
  ```
 
6
 
7
  # Model Checkpoints for ManiSkill-HAB
8
 
9
+ **[Paper](https://arxiv.org/abs/2412.13211)**
10
+ | **[Website](https://arth-shukla.github.io/mshab)**
11
+ | **[Code](https://github.com/arth-shukla/mshab)**
12
+ | **[Models](https://huggingface.co/arth-shukla/mshab_checkpoints)**
13
+ | **[(Full) Dataset](https://arth-shukla.github.io/mshab/#dataset-section)**
14
+ | **[Supplementary](https://sites.google.com/view/maniskill-hab)**
15
+
16
  RL (SAC, PPO) and IL (BC) baselines for ManiSkill-HAB. Each checkpoint includes a torch checkpoint `policy.pt` (model, optimizer/scheduler state, other trainable parameters) and a train config `config.yml` with hyperparemeters and env kwargs.
17
 
18
  RL Pick/Place policies are trained using SAC due to improved performance, while Open/Close is trained with PPO for wall-time efficiency (see Appendix A.4.3). All-object RL policies are under `all/` directories, while per-object policies are under directories labeled by the object name. IL policies do not have per-object Pick/Place variants.
 
21
  To download these policies, run the following:
22
  ```
23
  huggingface-cli download arth-shukla/mshab_checkpoints --local-dir mshab_checkpoints
24
+ ```
25
+
26
+ If you use ManiSkill-HAB in your work, please consider citing the following:
27
+
28
+ ```
29
+ @article{shukla2024maniskillhab,
30
+ author = {Arth Shukla and Stone Tao and Hao Su},
31
+ title = {ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks},
32
+ journal = {CoRR},
33
+ volume = {abs/2412.13211},
34
+ year = {2024},
35
+ url = {https://doi.org/10.48550/arXiv.2412.13211},
36
+ doi = {10.48550/ARXIV.2412.13211},
37
+ eprinttype = {arXiv},
38
+ eprint = {2412.13211},
39
+ timestamp = {Mon, 09 Dec 2024 01:29:24 +0100},
40
+ biburl = {https://dblp.org/rec/journals/corr/abs-2412-13211.bib},
41
+ bibsource = {dblp computer science bibliography, https://dblp.org}
42
+ }
43
  ```