adamo1139
/

Yi-6B-200K-AEZAKMI-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

adamo1139 commited on Dec 25, 2023

Commit

79249ca

·

1 Parent(s): abee49d

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -21,10 +21,11 @@ A chat with uncensored assistant.<|im_end|>
 {prompt}<|im_end|>
 <|im_start|>assistant
-Intended uses & limitations
 Use is limited by Yi license.
-Known Issues
 I recommend to set repetition penalty to something around 1.05 to avoid repetition. So far I had good experience running this model with temperature 1.2. \
 Stories have ChatGPT like paragraph spacing, I will work on this in the future maybe, not a high priority.
@@ -36,4 +37,6 @@ My next project is to attempt to de-contaminate base Yi-34B 4K and Yi-34B 200K u
 I was made aware of the frequent occurrence of the phrase "sending shivers down a spine" in the generations during RP of v1, so I fixed those samples - it should be better now.
 I can hold up to 300000 - 500000 ctx with 6bpw exl2 version and 8-bit cache - long context should work as good as other models trained on 200k version of Yi-6B
-There is also some issue with handling long system messages for RP, I was planning to investigate it for v2 but I didn't.

 {prompt}<|im_end|>
 <|im_start|>assistant
+## Intended uses & limitations
 Use is limited by Yi license.
+## Known Issues
 I recommend to set repetition penalty to something around 1.05 to avoid repetition. So far I had good experience running this model with temperature 1.2. \
 Stories have ChatGPT like paragraph spacing, I will work on this in the future maybe, not a high priority.
 I was made aware of the frequent occurrence of the phrase "sending shivers down a spine" in the generations during RP of v1, so I fixed those samples - it should be better now.
 I can hold up to 300000 - 500000 ctx with 6bpw exl2 version and 8-bit cache - long context should work as good as other models trained on 200k version of Yi-6B
+There is also some issue with handling long system messages for RP, I was planning to investigate it for v2 but I didn't.
+Samples of generations of this model are available here - https://huggingface.co/datasets/adamo1139/misc/tree/main/benchmarks