cognitivecomputations
/

dolphin-llama-13b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ehartford commited on Jul 23, 2023

Commit

b442d12

•

1 Parent(s): 25fd655

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -21,7 +21,7 @@ After uncensoring, deduping, and cleaning, our dataset consists of:
 - 842,610 instructions of FLANv2 augmented with GPT-4 completions
 - 2,625,353 instructions of FLANv2 augmented with GPT-3.5 completions
-We followed the submix and system prompt distribution outlined in the Orca paper. With a few exceptions. We included all 75k of CoT in the FLAN-1m dataset rather than sampling that. Also, we found that many items were duplicated, so we removed duplicates, resulting in 3.5m instructs in the ChatGPT dataset.
 Then we filtered out instances of alignment, refusal, avoidance, and bias, in order to produce an uncensored model upon which can be layered your personalized alignment LoRA.

 - 842,610 instructions of FLANv2 augmented with GPT-4 completions
 - 2,625,353 instructions of FLANv2 augmented with GPT-3.5 completions
+We followed the submix and system prompt distribution outlined in the Orca paper. With a few exceptions. We included all 75k of CoT in the FLAN-1m dataset rather than sampling that. Also, we found that many items were duplicated, so we removed duplicates.
 Then we filtered out instances of alignment, refusal, avoidance, and bias, in order to produce an uncensored model upon which can be layered your personalized alignment LoRA.