Update README.md
Browse files
README.md
CHANGED
@@ -28,8 +28,6 @@ Original model: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct
|
|
28 |
|
29 |
All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
|
30 |
|
31 |
-
Experimental quants are made with `--output-tensor-type f16 --token-embedding-type f16` per [ZeroWw](https://huggingface.co/ZeroWw)'s suggestion, please provide any feedback on quality differences you spot.
|
32 |
-
|
33 |
## Prompt format
|
34 |
|
35 |
```
|
@@ -64,6 +62,12 @@ Experimental quants are made with `--output-tensor-type f16 --token-embedding-ty
|
|
64 |
| [Phi-3.1-mini-4k-instruct-IQ2_S.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_S.gguf) | IQ2_S | 1.21GB | Very low quality, uses SOTA techniques to be usable. |
|
65 |
| [Phi-3.1-mini-4k-instruct-IQ2_XS.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_XS.gguf) | IQ2_XS | 1.15GB | Very low quality, uses SOTA techniques to be usable. |
|
66 |
|
|
|
|
|
|
|
|
|
|
|
|
|
67 |
## Downloading using huggingface-cli
|
68 |
|
69 |
First, make sure you have hugginface-cli installed:
|
|
|
28 |
|
29 |
All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
|
30 |
|
|
|
|
|
31 |
## Prompt format
|
32 |
|
33 |
```
|
|
|
62 |
| [Phi-3.1-mini-4k-instruct-IQ2_S.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_S.gguf) | IQ2_S | 1.21GB | Very low quality, uses SOTA techniques to be usable. |
|
63 |
| [Phi-3.1-mini-4k-instruct-IQ2_XS.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_XS.gguf) | IQ2_XS | 1.15GB | Very low quality, uses SOTA techniques to be usable. |
|
64 |
|
65 |
+
## Credits
|
66 |
+
|
67 |
+
Thank you kalomaze and Dampf for assistance in creating the imatrix calibration dataset
|
68 |
+
|
69 |
+
Thank you ZeroWw for the inspiration to experiment with embed/output
|
70 |
+
|
71 |
## Downloading using huggingface-cli
|
72 |
|
73 |
First, make sure you have hugginface-cli installed:
|