brucethemoose
/

Yi-34B-200K-elx2-39bpw

Text Generation

Model card Files Files and versions Community

brucethemoose commited on Nov 10, 2023

Commit

8f99dff

·

1 Parent(s): 099085d

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -8,7 +8,7 @@ See: https://huggingface.co/01-ai/Yi-34B-200K
 Yi-30B-200K quantized to 3.9bpw, which should allow for ~50K context on 24GB GPUs. Ask if you need another size.
-Quantized with 8K rows on a mix of wikitext and my own RP stories.
 Use with --enable-remote-code in text-gen-ui. Load with Exllamav2_HF, 8-bit cache, and *do not* use the `fast_tokenizer` option. The TFS preset seems to work well with Yi.

 Yi-30B-200K quantized to 3.9bpw, which should allow for ~50K context on 24GB GPUs. Ask if you need another size.
+Quantized with 8K rows on a mix of wikitext, prompt formatting, and my own RP stories.
 Use with --enable-remote-code in text-gen-ui. Load with Exllamav2_HF, 8-bit cache, and *do not* use the `fast_tokenizer` option. The TFS preset seems to work well with Yi.