BramVanroy commited on
Commit
c87198c
1 Parent(s): 5d7efec

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +23 -0
README.md ADDED
@@ -0,0 +1,23 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: mit
3
+ language:
4
+ - nl
5
+ tags:
6
+ - gguf
7
+ ---
8
+
9
+ This repository contains quantized versions of [BramVanroy/fietje-2b-chat](https://huggingface.co/BramVanroy/fietje-2b-chat):
10
+
11
+ - `-f16` (5.6GB): best quality, but largest and slowest (recommended if you have the capacity, otherwise q8_0)
12
+ - `-q8_0` (3.0GB): minimal quality loss, smaller
13
+ - `-q5_k_m` (2.0GB): users have reported considerable quality loss in the chat `q5_k_m` version so you may want to avoid it
14
+
15
+ Also available on ollama:
16
+
17
+ ```sh
18
+ # defaults to f16
19
+ ollama run bramvanroy/fietje-2b-chat
20
+ ollama run bramvanroy/fietje-2b-chat:f16
21
+ ollama run bramvanroy/fietje-2b-chat:q8_0
22
+ ollama run bramvanroy/fietje-2b-chat:q5_k_m
23
+ ```