brucethemoose
commited on
Commit
•
abf1ecd
1
Parent(s):
8f99dff
Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ Yi-30B-200K quantized to 3.9bpw, which should allow for ~50K context on 24GB GPU
|
|
10 |
|
11 |
Quantized with 8K rows on a mix of wikitext, prompt formatting, and my own RP stories.
|
12 |
|
13 |
-
Use with --enable-remote-code in text-gen-ui. Load with Exllamav2_HF, 8-bit cache, and *do not* use the `fast_tokenizer` option. The TFS preset seems to work well with Yi.
|
14 |
|
15 |
|
16 |
|
|
|
10 |
|
11 |
Quantized with 8K rows on a mix of wikitext, prompt formatting, and my own RP stories.
|
12 |
|
13 |
+
Use with --enable-remote-code in text-gen-ui. Load with Exllamav2_HF, use 8-bit cache, and *do not* use the `fast_tokenizer` option. The TFS preset seems to work well with Yi.
|
14 |
|
15 |
|
16 |
|