InferenceIllusionist commited on
Commit
38f4356
1 Parent(s): 3fc53a8

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +2 -1
README.md CHANGED
@@ -6,13 +6,14 @@ tags:
6
  - storywriting
7
  license: cc-by-nc-4.0
8
  ---
 
9
 
10
  <h3> Model Card for Fimbulvetr-11B-v2-iMat-GGUF</h3>
11
 
12
  * Model creator: [Sao10K](https://huggingface.co/Sao10K/)
13
  * Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
14
 
15
- <b>Update 3/4/24: </b> Newest I-Quant format <b>[IQ4_XS](https://huggingface.co/InferenceIllusionist/Fimbulvetr-11B-v2-iMat-GGUF/blob/main/Fimbulvetr-11B-v2-iMat-IQ4_XS.gguf)</b> shows superior performance to previous I-quants @ a whopping 4.25 bpw in [benchmarks](https://github.com/ggerganov/llama.cpp/pull/5747)
16
 
17
  Tested on latest llama.cpp & koboldcpp v.1.60.
18
 
 
6
  - storywriting
7
  license: cc-by-nc-4.0
8
  ---
9
+ <img src="https://i.imgur.com/P68dXux.png" width="400"/>
10
 
11
  <h3> Model Card for Fimbulvetr-11B-v2-iMat-GGUF</h3>
12
 
13
  * Model creator: [Sao10K](https://huggingface.co/Sao10K/)
14
  * Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
15
 
16
+ <b>Update 4/15/24: Added a few missing quants to the list </b>
17
 
18
  Tested on latest llama.cpp & koboldcpp v.1.60.
19