yuslzp commited on
Commit
dba1671
β€’
1 Parent(s): 4819d9b

Upload README.md

Browse files
Files changed (1) hide show
  1. README.md +52 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ # Instruction for downloading data from the sft-data repository.
2
+
3
+ First, you would want to log in and access the huggingface data through using
4
+
5
+ ```py
6
+ from huggingface_hub import login
7
+ login()
8
+ ```
9
+
10
+ Then, you could either download the zip file of the all the sft data folders, which would look like
11
+
12
+ ```py
13
+ from huggingface_hub import hf_hub_download
14
+ hf_hub_download(repo_id="LEVI-Project/sft-data", filename="sft-data.zip")
15
+ ```
16
+
17
+ Notice that the `sft-data.zip` file above has the following structure:
18
+
19
+ ```
20
+ sft-data
21
+ β”‚ README.md This README file.
22
+ └───alf Folder for ALFWORLD.
23
+ β”‚ β”‚ alfworld.json The JSON file for ALFWORLD.
24
+ β”‚ └───alf_data_folder Folder for the ALFWORLD environment.
25
+ β”‚ β”‚ alf_image_id_0 Folder 0 for ALFWORLD image data
26
+ β”‚ β”‚ alf_image_id_1 Folder 1 for ALFWORLD image data
27
+ β”‚ β”‚ alf_image_id_2 Folder 3 for ALFWORLD image data
28
+ β”‚ β”‚ alf_image_id_3 Folder 3 for ALFWORLD image data
29
+ β”‚ β”‚ alf_image_id_4 Folder 4 for ALFWORLD image data
30
+ └───blackjack Folder for blackjack environment in the `gym_cards`
31
+ β”‚ blackjack_data_folder Folder for blackjack image data.
32
+ β”‚ blackjack.json The JSON file for blackjack.
33
+ └───ezpoints Folder for ezpoints environment in the `gym_cards`.
34
+ β”‚ ezpoints_data_folder Folder for ezpoints image data.
35
+ β”‚ ezpoints.json The JSON file for ezpoints.
36
+ └───points24 Folder for points24 environment in the `gym_cards`.
37
+ β”‚ points24_data_folder Folder for points24 image data.
38
+ β”‚ points24.json The JSON file for points24.
39
+ └───numberline Folder for numberline environment in the `gym_cards`
40
+ β”‚ numberline_data_folder Folder for numberline image data.
41
+ β”‚ numberline.json The JSON file for numberline.
42
+ ```
43
+
44
+ Also, you could choose to download the files for any environment out of the five ones. For example, you should be using the following code for downloading data from blackjack.
45
+
46
+ ```py
47
+ from huggingface_hub import hf_hub_download
48
+ hf_hub_download(repo_id="LEVI-Project/llava-data", filename="blackjack.zip") # zip folder for image data folder
49
+ hf_hub_download(repo_id="LEVI-Project/llava-data", filename="blackjack.json") # JSON file
50
+ ```
51
+
52
+ For ALFWORLD, notice that the zip file for the image data folder is `alf_data_folder.zip`.