mradermacher
commited on
Commit
•
d6c0ecf
1
Parent(s):
64051f3
auto-patch README.md
Browse files
README.md
CHANGED
@@ -36,6 +36,7 @@ more details, including on how to concatenate multi-part files.
|
|
36 |
| Link | Type | Size/GB | Notes |
|
37 |
|:-----|:-----|--------:|:------|
|
38 |
| [GGUF](https://huggingface.co/mradermacher/Aether-12b-GGUF/resolve/main/Aether-12b.IQ3_S.gguf) | IQ3_S | 5.7 | beats Q3_K* |
|
|
|
39 |
|
40 |
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
41 |
types (lower is better):
|
@@ -54,6 +55,6 @@ questions you might have and/or if you want some other model quantized.
|
|
54 |
|
55 |
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
56 |
me use its servers and providing upgrades to my workstation to enable
|
57 |
-
this work in my free time.
|
58 |
|
59 |
<!-- end -->
|
|
|
36 |
| Link | Type | Size/GB | Notes |
|
37 |
|:-----|:-----|--------:|:------|
|
38 |
| [GGUF](https://huggingface.co/mradermacher/Aether-12b-GGUF/resolve/main/Aether-12b.IQ3_S.gguf) | IQ3_S | 5.7 | beats Q3_K* |
|
39 |
+
| [GGUF](https://huggingface.co/mradermacher/Aether-12b-GGUF/resolve/main/Aether-12b.Q4_K_S.gguf) | Q4_K_S | 7.2 | fast, recommended |
|
40 |
|
41 |
Here is a handy graph by ikawrakow comparing some lower-quality quant
|
42 |
types (lower is better):
|
|
|
55 |
|
56 |
I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
|
57 |
me use its servers and providing upgrades to my workstation to enable
|
58 |
+
this work in my free time.
|
59 |
|
60 |
<!-- end -->
|