brucethemoose
/

Capybara-Tess-Yi-34B-200K-exl2-31bpw-fiction

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

brucethemoose commited on Nov 19, 2023

Commit

871e0b3

•

1 Parent(s): eea669a

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -7,7 +7,7 @@ language:
 library_name: transformers
 pipeline_tag: text-generation
 ---
-Nous-Capybara-34B and Tess-M-Creative-v1.0` merged, then quantized with exllamav2 on 200 rows (400K tokens) on a long Vicuna format chat, a sci fi story and a fantasy story. This should hopefully yield better chat performance than the default wikitext quantization.
 Quantized to 3.1bpw, enough for **~75K context on a 24GB GPU.**

 library_name: transformers
 pipeline_tag: text-generation
 ---
+Nous-Capybara-34B and Tess-M-Creative-v1.0 merged, then quantized with exllamav2 on 200 rows (400K tokens) on a long Vicuna format chat, a sci fi story and a fantasy story. This should hopefully yield better chat performance than the default wikitext quantization.
 Quantized to 3.1bpw, enough for **~75K context on a 24GB GPU.**