Mahesh Sinha
commited on
Commit
•
0aeb7c7
1
Parent(s):
9492d4e
Fixing Data Readme
Browse files- data/README.md +4 -4
data/README.md
CHANGED
@@ -10,10 +10,10 @@ curl https://github.com/teknium1/GPTeacher/blob/main/Roleplay/roleplay-similarit
|
|
10 |
## Convert the JSON data files to JSONL.
|
11 |
|
12 |
```shell
|
13 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
14 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
15 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
16 |
-
python3 ./scripts/alpaca_json_to_jsonl.py --
|
17 |
```
|
18 |
---
|
19 |
|
|
|
10 |
## Convert the JSON data files to JSONL.
|
11 |
|
12 |
```shell
|
13 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/alpaca_data_gpt4.json --output data/alpaca_data_gpt4.jsonl
|
14 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/raw/vicuna_cleaned.json --output data/vicuna_cleaned.jsonl
|
15 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/raw/roleplay-similarity_0.6-instruct-dataset.json --output data/roleplay-similarity_0.6-instruct-dataset.jsonl
|
16 |
+
python3 ./scripts/alpaca_json_to_jsonl.py --file data/raw/gpt4-instruct-similarity-0.6-dataset.json --output data/gpt4-instruct-similarity-0.6-dataset.jsonl
|
17 |
```
|
18 |
---
|
19 |
|