dranger003
/

bagel-dpo-34b-v0.5-iMat.GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

dranger003 commited on Apr 2, 2024

Commit

df13628

·

verified ·

1 Parent(s): 46a1a74

Update README.md

Files changed (1) hide show

README.md +4 -2

README.md CHANGED Viewed

@@ -6,8 +6,10 @@ library_name: gguf
 pipeline_tag: text-generation
 base_model: jondurbin/bagel-dpo-34b-v0.5
 ---
-GGUF importance matrix (imatrix) quants for https://huggingface.co/jondurbin/bagel-dpo-34b-v0.5
-The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using wiki.train.raw.
 | Layers | Context | [Template](https://huggingface.co/jondurbin/bagel-dpo-34b-v0.5#prompt-formatting) |
 | --- | --- | --- |

 pipeline_tag: text-generation
 base_model: jondurbin/bagel-dpo-34b-v0.5
 ---
+* GGUF importance matrix (imatrix) quants for https://huggingface.co/jondurbin/bagel-dpo-34b-v0.5
+* The importance matrix was trained for 100K tokens (200 batches of 512 tokens) using wiki.train.raw.
+* The [imatrix is being used on the K-quants](https://github.com/ggerganov/llama.cpp/pull/4930) as well (below Q6_K).
+* Generated with llama.cpp commit `f87f7b89`
 | Layers | Context | [Template](https://huggingface.co/jondurbin/bagel-dpo-34b-v0.5#prompt-formatting) |
 | --- | --- | --- |