TriadParty
/

deepsex-34b

Text Generation

feature-extraction

Not-For-All-Audiences

Model card Files Files and versions Community

zhouliang commited on Dec 5, 2023

Commit

33502f3

•

1 Parent(s): 629e9cc

Update README.md

Files changed (1) hide show

README.md +30 -0

README.md CHANGED Viewed

@@ -18,6 +18,21 @@ Here are the steps to make this model:
 3. Prepare the limarp+pippa data set, clean it into alpaca format, and use [goliath-120b](https://huggingface.co/alpindale/goliath-120b), which is good at role-playing, to score each question and answer pair, and filter out the high-quality ones. 30k data.
 4. Use the data in 3 for sft on the base model obtained in 2, 6 epochs, r=16 alpha=32 for fine-tuning.
 *Effect*:
 Proficient in role-playing skills, while being highly accepted on NSFW, pure love words will appear from time to time. like:
 ```#3
@@ -44,6 +59,21 @@ Support me [here](https://ko-fi.com/mikolisa) :)
 3. 准备limarp+pippa数据集，统一清洗为alpaca格式，并且使用比较擅长角色扮演的[goliath-120b](https://huggingface.co/alpindale/goliath-120b)对每个问答对进行打分，筛选出其中质量高的大约30k数据。
 4. 对2中得到的base模型使用3中的数据进行sft，6个epochs，r=16 alpha=32进行微调。
 *效果*
 熟练的角色扮演技能，在NSFW上有很高接受度的同时，会时不时的出现纯爱的话语。如：
 ```#3

 3. Prepare the limarp+pippa data set, clean it into alpaca format, and use [goliath-120b](https://huggingface.co/alpindale/goliath-120b), which is good at role-playing, to score each question and answer pair, and filter out the high-quality ones. 30k data.
 4. Use the data in 3 for sft on the base model obtained in 2, 6 epochs, r=16 alpha=32 for fine-tuning.
+*Format*
+alpaca
+```[
+  {
+    "instruction": "user instruction (required)",
+    "input": "user input (optional)",
+    "output": "model response (required)",
+    "history": [
+      ["user instruction in the first round (optional)", "model response in the first round (optional)"],
+      ["user instruction in the second round (optional)", "model response in the second round (optional)"]
+    ]
+  }
+]```
 *Effect*:
 Proficient in role-playing skills, while being highly accepted on NSFW, pure love words will appear from time to time. like:
 ```#3
 3. 准备limarp+pippa数据集，统一清洗为alpaca格式，并且使用比较擅长角色扮演的[goliath-120b](https://huggingface.co/alpindale/goliath-120b)对每个问答对进行打分，筛选出其中质量高的大约30k数据。
 4. 对2中得到的base模型使用3中的数据进行sft，6个epochs，r=16 alpha=32进行微调。
+*格式*
+alpaca
+```[
+  {
+    "instruction": "user instruction (required)",
+    "input": "user input (optional)",
+    "output": "model response (required)",
+    "history": [
+      ["user instruction in the first round (optional)", "model response in the first round (optional)"],
+      ["user instruction in the second round (optional)", "model response in the second round (optional)"]
+    ]
+  }
+]```
 *效果*
 熟练的角色扮演技能，在NSFW上有很高接受度的同时，会时不时的出现纯爱的话语。如：
 ```#3