Not-For-All-Audiences

Inference Endpoints

imatrix

Model card Files Files and versions Community

mradermacher commited on 12 days ago

Commit

a14df41

•

1 Parent(s): e82011e

uploaded from nethype/db1

Browse files

Files changed (24) hide show

.gitattributes +22 -0
Psyfighter2-13B-vore.i1-IQ1_M.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ1_S.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ2_M.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ2_S.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ2_XS.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ2_XXS.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ3_M.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ3_S.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ3_XS.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ3_XXS.gguf +3 -0
Psyfighter2-13B-vore.i1-IQ4_XS.gguf +3 -0
Psyfighter2-13B-vore.i1-Q2_K.gguf +3 -0
Psyfighter2-13B-vore.i1-Q3_K_L.gguf +3 -0
Psyfighter2-13B-vore.i1-Q3_K_M.gguf +3 -0
Psyfighter2-13B-vore.i1-Q3_K_S.gguf +3 -0
Psyfighter2-13B-vore.i1-Q4_0.gguf +3 -0
Psyfighter2-13B-vore.i1-Q4_K_M.gguf +3 -0
Psyfighter2-13B-vore.i1-Q4_K_S.gguf +3 -0
Psyfighter2-13B-vore.i1-Q5_K_M.gguf +3 -0
Psyfighter2-13B-vore.i1-Q5_K_S.gguf +3 -0
Psyfighter2-13B-vore.i1-Q6_K.gguf +3 -0
README.md +82 -0
imatrix.dat +3 -0

.gitattributes CHANGED Viewed

@@ -33,3 +33,25 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ2_M.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ2_S.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ2_XXS.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ3_XXS.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
+Psyfighter2-13B-vore.i1-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
+imatrix.dat filter=lfs diff=lfs merge=lfs -text

Psyfighter2-13B-vore.i1-IQ1_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:61853f27fe33bebe4380a987268cff78df0ffa4e7df4aee681b62e00f793d0e8
+size 3138610688

Psyfighter2-13B-vore.i1-IQ1_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:9c0c35698d4e1fb296222d1e53dd80763d19f2ea96c6728a642f5aba218b33ca
+size 2898687488

Psyfighter2-13B-vore.i1-IQ2_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e21b469e9b3b4b664eb754b32a66063888d48088ff9bfc2736f2596bf79fd5d0
+size 4517580288

Psyfighter2-13B-vore.i1-IQ2_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b27ed3203aa765b8b4d04b047fac8925364fa5965dfe18a0e22379cab9412eaa
+size 4197682688

Psyfighter2-13B-vore.i1-IQ2_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:1dd8a5be8b0c7e4806f664d468f7a8e244edf526caf32f231f1e54c5b46671cc
+size 3891148288

Psyfighter2-13B-vore.i1-IQ2_XXS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7bcbd6625c4ea530e0783452dd1b65ef68b02cf46c329b20b967a8133eaad2e9
+size 3538482688

Psyfighter2-13B-vore.i1-IQ3_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:83f2d8dd807a4477e3945598249eb0b1752c99015ffd5a0b1dca8644ed316f80
+size 5984511488

Psyfighter2-13B-vore.i1-IQ3_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:c96c44a557585369323867e0fb4800e7f07bfad5eeb9934f428266f8d999924e
+size 5658981888

Psyfighter2-13B-vore.i1-IQ3_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:926fe129acaeb8072c97cb84da9945c7dae0cbc39996097e89c95984785335ad
+size 5361612288

Psyfighter2-13B-vore.i1-IQ3_XXS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:d6828bfffdd1472c2cb4635ced86230fa5ac352c81643a25320e433773a2e7ef
+size 4960562688

Psyfighter2-13B-vore.i1-IQ4_XS.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:faa49dcf46f4b0d4801d2fce5c2628ca69129af053f78293b928fa4b1eb6a9de
+size 6964223488

Psyfighter2-13B-vore.i1-Q2_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dd236315649eb60ff5f6f76a8003654e7cdcca5db81187af05861668f4ede89c
+size 4854271488

Psyfighter2-13B-vore.i1-Q3_K_L.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fdb68dbc1190a0cfabddbf057f64ad1cbd13bbe617f801b909f89a04b4c7352b
+size 6929561088

Psyfighter2-13B-vore.i1-Q3_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7e092a1c3383b8f93a678525415fd87d714623248877ce5d484843182c90e227
+size 6337771008

Psyfighter2-13B-vore.i1-Q3_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:b026d9d6630a7984e5daba41c83234e7223b80b43400aaf35d233e64708a2453
+size 5658981888

Psyfighter2-13B-vore.i1-Q4_0.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a685de521549c53d34581d7e01d047250c5073b60c070461b014e377804384b5
+size 7387954688

Psyfighter2-13B-vore.i1-Q4_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3d5fce9e0001b8b8a06da9da2e4e6ac360deab7706fe6cbe00f5d3e68253a552
+size 7865957888

Psyfighter2-13B-vore.i1-Q4_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:965bd0b7598d19be925f03814cba2f8f16d3b9bcb0204c8fd3423fed3e608f03
+size 7423180288

Psyfighter2-13B-vore.i1-Q5_K_M.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ff1fb5a25b18d5859a53dcd02800718518f77a9b66a42e13836ee3110f386607
+size 9229925888

Psyfighter2-13B-vore.i1-Q5_K_S.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4a56ab86cbb6259075f2d83dc8589c1c85b1b294efb3c3bacab40105aa60e6ff
+size 8972287488

Psyfighter2-13B-vore.i1-Q6_K.gguf ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:5c25173ca204edae407a9e2726cf93c6d7204efb0c035fe783ade912b3a38429
+size 10679141888

README.md ADDED Viewed

	@@ -0,0 +1,82 @@

+---
+base_model: SnakyMcSnekFace/Psyfighter2-13B-vore
+language:
+- en
+library_name: transformers
+license: llama2
+model_type: llama
+prompt_template: "### Instruction: \nBelow is an instruction that describes a task.
+  Write a response that appropriately completes the request.\n### Input:\n{prompt}\n###
+  Response:\n"
+quantized_by: mradermacher
+tags:
+- pytorch
+- storywriting
+- finetuned
+- not-for-all-audiences
+---
+## About
+<!-- ### quantize_version: 2 -->
+<!-- ### output_tensor_quantised: 1 -->
+<!-- ### convert_type: hf -->
+<!-- ### vocab_type:  -->
+<!-- ### tags: nicoboss -->
+weighted/imatrix quants of https://huggingface.co/SnakyMcSnekFace/Psyfighter2-13B-vore
+<!-- provided-files -->
+static quants are available at https://huggingface.co/mradermacher/Psyfighter2-13B-vore-GGUF
+## Usage
+If you are unsure how to use GGUF files, refer to one of [TheBloke's
+READMEs](https://huggingface.co/TheBloke/KafkaLM-70B-German-V0.1-GGUF) for
+more details, including on how to concatenate multi-part files.
+## Provided Quants
+(sorted by size, not necessarily quality. IQ-quants are often preferable over similar sized non-IQ quants)
+| Link | Type | Size/GB | Notes |
+|:-----|:-----|--------:|:------|
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ1_S.gguf) | i1-IQ1_S | 3.0 | for the desperate |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ1_M.gguf) | i1-IQ1_M | 3.2 | mostly desperate |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ2_XXS.gguf) | i1-IQ2_XXS | 3.6 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ2_XS.gguf) | i1-IQ2_XS | 4.0 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ2_S.gguf) | i1-IQ2_S | 4.3 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ2_M.gguf) | i1-IQ2_M | 4.6 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q2_K.gguf) | i1-Q2_K | 5.0 | IQ3_XXS probably better |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ3_XXS.gguf) | i1-IQ3_XXS | 5.1 | lower quality |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ3_XS.gguf) | i1-IQ3_XS | 5.5 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ3_S.gguf) | i1-IQ3_S | 5.8 | beats Q3_K* |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q3_K_S.gguf) | i1-Q3_K_S | 5.8 | IQ3_XS probably better |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ3_M.gguf) | i1-IQ3_M | 6.1 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q3_K_M.gguf) | i1-Q3_K_M | 6.4 | IQ3_S probably better |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q3_K_L.gguf) | i1-Q3_K_L | 7.0 | IQ3_M probably better |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-IQ4_XS.gguf) | i1-IQ4_XS | 7.1 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q4_0.gguf) | i1-Q4_0 | 7.5 | fast, low quality |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q4_K_S.gguf) | i1-Q4_K_S | 7.5 | optimal size/speed/quality |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q4_K_M.gguf) | i1-Q4_K_M | 8.0 | fast, recommended |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q5_K_S.gguf) | i1-Q5_K_S | 9.1 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q5_K_M.gguf) | i1-Q5_K_M | 9.3 |  |
+| [GGUF](https://huggingface.co/mradermacher/Psyfighter2-13B-vore-i1-GGUF/resolve/main/Psyfighter2-13B-vore.i1-Q6_K.gguf) | i1-Q6_K | 10.8 | practically like static Q6_K |
+Here is a handy graph by ikawrakow comparing some lower-quality quant
+types (lower is better):
+![image.png](https://www.nethype.de/huggingface_embed/quantpplgraph.png)
+And here are Artefact2's thoughts on the matter:
+https://gist.github.com/Artefact2/b5f810600771265fc1e39442288e8ec9
+## FAQ / Model Request
+See https://huggingface.co/mradermacher/model_requests for some answers to
+questions you might have and/or if you want some other model quantized.
+## Thanks
+I thank my company, [nethype GmbH](https://www.nethype.de/), for letting
+me use its servers and providing upgrades to my workstation to enable
+this work in my free time. Additional thanks to [@nicoboss](https://huggingface.co/nicoboss) for giving me access to his private supercomputer, enabling me to provide many more imatrix quants, at much higher quality, than I would otherwise be able to.
+<!-- end -->

imatrix.dat ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:a18fbf332a81fc91cb5fbb1b4a1d511cbcad5fcca6024ed2fffd752cffd4f122
+size 7136325