bartowski commited on
Commit
a6596c4
1 Parent(s): b25cb58

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -2
README.md CHANGED
@@ -28,8 +28,6 @@ Original model: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct
28
 
29
  All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
30
 
31
- Experimental quants are made with `--output-tensor-type f16 --token-embedding-type f16` per [ZeroWw](https://huggingface.co/ZeroWw)'s suggestion, please provide any feedback on quality differences you spot.
32
-
33
  ## Prompt format
34
 
35
  ```
@@ -64,6 +62,12 @@ Experimental quants are made with `--output-tensor-type f16 --token-embedding-ty
64
  | [Phi-3.1-mini-4k-instruct-IQ2_S.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_S.gguf) | IQ2_S | 1.21GB | Very low quality, uses SOTA techniques to be usable. |
65
  | [Phi-3.1-mini-4k-instruct-IQ2_XS.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_XS.gguf) | IQ2_XS | 1.15GB | Very low quality, uses SOTA techniques to be usable. |
66
 
 
 
 
 
 
 
67
  ## Downloading using huggingface-cli
68
 
69
  First, make sure you have hugginface-cli installed:
 
28
 
29
  All quants made using imatrix option with dataset from [here](https://gist.github.com/bartowski1182/eb213dccb3571f863da82e99418f81e8)
30
 
 
 
31
  ## Prompt format
32
 
33
  ```
 
62
  | [Phi-3.1-mini-4k-instruct-IQ2_S.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_S.gguf) | IQ2_S | 1.21GB | Very low quality, uses SOTA techniques to be usable. |
63
  | [Phi-3.1-mini-4k-instruct-IQ2_XS.gguf](https://huggingface.co/bartowski/Phi-3.1-mini-4k-instruct-GGUF/blob/main/Phi-3.1-mini-4k-instruct-IQ2_XS.gguf) | IQ2_XS | 1.15GB | Very low quality, uses SOTA techniques to be usable. |
64
 
65
+ ## Credits
66
+
67
+ Thank you kalomaze and Dampf for assistance in creating the imatrix calibration dataset
68
+
69
+ Thank you ZeroWw for the inspiration to experiment with embed/output
70
+
71
  ## Downloading using huggingface-cli
72
 
73
  First, make sure you have hugginface-cli installed: