Nexesenex
/

MIstral-QUantized-70b_Miqu-1-70b-iMat.GGUF

Model card Files Files and versions Community

Nexesenex commited on Jan 31

Commit

282eec8

•

1 Parent(s): 9fbbbda

Update README.md

Files changed (1) hide show

README.md +16 -1

README.md CHANGED Viewed

@@ -1,10 +1,23 @@
 Requantizations of a Q5_K_M quant of a trending 70b model without better quant/fp16 available, this through a Q8_0 intermediary step.
-Q3_K_M, Q3_K_S, Q3_K_XS, Q2_K_S, IQ3_XXS, IQ2_XS available. Miqudev provided Q5_K_M, Q4_K_M, and Q2_K from his probable FP16.
 Bonus : a Kobold.CPP Frankenstein which reads IQ3_XXS models and is not affected by the Kobold.CPP 1.56/1.57 slowdown at the cost of an absent Mixtral fix.
 https://github.com/Nexesenex/kobold.cpp/releases/tag/v1.57_b2030
 Miku 70b has a theta of 1,000,000, like CodeLlama, and not 10,000, like Llama 2 models usually have.
 That feature singularizes it to my knowledge to ALL Llama 2 models, beside Codellamas which also have a theta of 1,000,000..
@@ -21,6 +34,8 @@ BUT Miqu is NOT a CodeLlama 70b (released only a few days after Miqu 70b), becau
 So, CodeLlama 70b is nerfed like the other CodeLlama in general benchmarks terms, while Miku is matching a FINETUNED Llama-2 expectations.
 Benchs I made with the original Q2_K quant of Miku 70b, made from the FP16 and published by Miqudev :
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6451b24dc5d273f95482bfa4/wiDlIl1FMrVQo0fAcr3YO.png)

+Miqu 1 70b : a possible leak of Mistral Medium Alpha
+---
 Requantizations of a Q5_K_M quant of a trending 70b model without better quant/fp16 available, this through a Q8_0 intermediary step.
+Miqudev provided Q5_K_M, Q4_K_M, and Q2_K from his probable FP16.
+Here, you will find :
+- Q3_K_M, Q3_K_S, Q3_K_XS, Q2_K_S, IQ3_XXS SOTA and IQ2_XS SOTA available.
+- Q3_K_L and Q4_K_S on quantization for tonight.
+- IQ2_XXS SOTA for tomorrow.
+---
 Bonus : a Kobold.CPP Frankenstein which reads IQ3_XXS models and is not affected by the Kobold.CPP 1.56/1.57 slowdown at the cost of an absent Mixtral fix.
 https://github.com/Nexesenex/kobold.cpp/releases/tag/v1.57_b2030
+---
 Miku 70b has a theta of 1,000,000, like CodeLlama, and not 10,000, like Llama 2 models usually have.
 That feature singularizes it to my knowledge to ALL Llama 2 models, beside Codellamas which also have a theta of 1,000,000..
 So, CodeLlama 70b is nerfed like the other CodeLlama in general benchmarks terms, while Miku is matching a FINETUNED Llama-2 expectations.
+---
 Benchs I made with the original Q2_K quant of Miku 70b, made from the FP16 and published by Miqudev :
 ![image/png](https://cdn-uploads.huggingface.co/production/uploads/6451b24dc5d273f95482bfa4/wiDlIl1FMrVQo0fAcr3YO.png)