variante commited on
Commit
6f67077
1 Parent(s): 508fdf7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +33 -3
README.md CHANGED
@@ -1,3 +1,33 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ inference: false
3
+ license: apache-2.0
4
+ datasets:
5
+ - VIMA/VIMA-Data
6
+ tags:
7
+ - llara
8
+ - robotics
9
+ - vlm
10
+ pipeline_tag: object-detection
11
+ ---
12
+ <br>
13
+ <be>
14
+
15
+ # Model Card
16
+
17
+ This repository contains three object detection models released with paper **LLaRA: Supercharging Robot Learning Data for Vision-Language Policy**
18
+
19
+ [Xiang Li](https://xxli.me)<sup>1</sup>, [Cristina Mata](https://openreview.net/profile?id=~Cristina_Mata1)<sup>1</sup>, [Jongwoo Park](https://github.com/jongwoopark7978)<sup>1</sup>, [Kumara Kahatapitiya](https://www3.cs.stonybrook.edu/~kkahatapitiy)<sup>1</sup>, [Yoo Sung Jang](https://yjang43.github.io/)<sup>1</sup>, [Jinghuan Shang](https://elicassion.github.io/)<sup>1</sup>, [Kanchana Ranasinghe](https://kahnchana.github.io/)<sup>1</sup>, [Ryan Burgert](https://ryanndagreat.github.io/)<sup>1</sup>, [Mu Cai](https://pages.cs.wisc.edu/~mucai/)<sup>2</sup>, [Yong Jae Lee](https://pages.cs.wisc.edu/~yongjaelee/)<sup>2</sup>, and [Michael S. Ryoo](http://michaelryoo.com/)<sup>1</sup>
20
+
21
+ <sup>1</sup>Stony Brook University <sup>2</sup>University of Wisconsin-Madison
22
+
23
+ ## Model details
24
+
25
+ **Model type:**
26
+ This repository contains three models trained on three subsets respectively, converted from [VIMA-Data](https://huggingface.co/datasets/VIMA/VIMA-Data).
27
+ For the conversion code, please refer to [convert_vima.ipynb](https://github.com/LostXine/LLaRA/blob/main/datasets/convert_vima.ipynb)
28
+
29
+ **Paper or resources for more information:**
30
+ https://github.com/LostXine/LLaRA
31
+
32
+ **Where to send questions or comments about the model:**
33
+ https://github.com/LostXine/LLaRA/issues