Upload README.md
Browse files
README.md
ADDED
@@ -0,0 +1,52 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Instruction for downloading data from the sft-data repository.
|
2 |
+
|
3 |
+
First, you would want to log in and access the huggingface data through using
|
4 |
+
|
5 |
+
```py
|
6 |
+
from huggingface_hub import login
|
7 |
+
login()
|
8 |
+
```
|
9 |
+
|
10 |
+
Then, you could either download the zip file of the all the sft data folders, which would look like
|
11 |
+
|
12 |
+
```py
|
13 |
+
from huggingface_hub import hf_hub_download
|
14 |
+
hf_hub_download(repo_id="LEVI-Project/sft-data", filename="sft-data.zip")
|
15 |
+
```
|
16 |
+
|
17 |
+
Notice that the `sft-data.zip` file above has the following structure:
|
18 |
+
|
19 |
+
```
|
20 |
+
sft-data
|
21 |
+
β README.md This README file.
|
22 |
+
ββββalf Folder for ALFWORLD.
|
23 |
+
β β alfworld.json The JSON file for ALFWORLD.
|
24 |
+
β ββββalf_data_folder Folder for the ALFWORLD environment.
|
25 |
+
β β alf_image_id_0 Folder 0 for ALFWORLD image data
|
26 |
+
β β alf_image_id_1 Folder 1 for ALFWORLD image data
|
27 |
+
β β alf_image_id_2 Folder 3 for ALFWORLD image data
|
28 |
+
β β alf_image_id_3 Folder 3 for ALFWORLD image data
|
29 |
+
β β alf_image_id_4 Folder 4 for ALFWORLD image data
|
30 |
+
ββββblackjack Folder for blackjack environment in the `gym_cards`
|
31 |
+
β blackjack_data_folder Folder for blackjack image data.
|
32 |
+
β blackjack.json The JSON file for blackjack.
|
33 |
+
ββββezpoints Folder for ezpoints environment in the `gym_cards`.
|
34 |
+
β ezpoints_data_folder Folder for ezpoints image data.
|
35 |
+
β ezpoints.json The JSON file for ezpoints.
|
36 |
+
ββββpoints24 Folder for points24 environment in the `gym_cards`.
|
37 |
+
β points24_data_folder Folder for points24 image data.
|
38 |
+
β points24.json The JSON file for points24.
|
39 |
+
ββββnumberline Folder for numberline environment in the `gym_cards`
|
40 |
+
β numberline_data_folder Folder for numberline image data.
|
41 |
+
β numberline.json The JSON file for numberline.
|
42 |
+
```
|
43 |
+
|
44 |
+
Also, you could choose to download the files for any environment out of the five ones. For example, you should be using the following code for downloading data from blackjack.
|
45 |
+
|
46 |
+
```py
|
47 |
+
from huggingface_hub import hf_hub_download
|
48 |
+
hf_hub_download(repo_id="LEVI-Project/llava-data", filename="blackjack.zip") # zip folder for image data folder
|
49 |
+
hf_hub_download(repo_id="LEVI-Project/llava-data", filename="blackjack.json") # JSON file
|
50 |
+
```
|
51 |
+
|
52 |
+
For ALFWORLD, notice that the zip file for the image data folder is `alf_data_folder.zip`.
|