bartowski commited on
Commit
ea5fec4
1 Parent(s): c92baaf

Llamacpp quants

Browse files
.gitattributes CHANGED
@@ -33,3 +33,19 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ NeuralSirKrishna-7b-IQ3_M.gguf filter=lfs diff=lfs merge=lfs -text
37
+ NeuralSirKrishna-7b-IQ3_S.gguf filter=lfs diff=lfs merge=lfs -text
38
+ NeuralSirKrishna-7b-IQ4_NL.gguf filter=lfs diff=lfs merge=lfs -text
39
+ NeuralSirKrishna-7b-IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
40
+ NeuralSirKrishna-7b-Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
41
+ NeuralSirKrishna-7b-Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
42
+ NeuralSirKrishna-7b-Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
43
+ NeuralSirKrishna-7b-Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
44
+ NeuralSirKrishna-7b-Q4_0.gguf filter=lfs diff=lfs merge=lfs -text
45
+ NeuralSirKrishna-7b-Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
46
+ NeuralSirKrishna-7b-Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
47
+ NeuralSirKrishna-7b-Q5_0.gguf filter=lfs diff=lfs merge=lfs -text
48
+ NeuralSirKrishna-7b-Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
49
+ NeuralSirKrishna-7b-Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
50
+ NeuralSirKrishna-7b-Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
51
+ NeuralSirKrishna-7b-Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
NeuralSirKrishna-7b-IQ3_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46e9e0c90c776acf092064479fcd43b956eac74f4454e5bfc83cb69d7fd26690
3
+ size 3284891360
NeuralSirKrishna-7b-IQ3_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:02ce5f66d03f48b61a627b28326f57a70535d60b49bd46100573825b703328e4
3
+ size 3182393056
NeuralSirKrishna-7b-IQ4_NL.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f1aff009a765015a543d361263c62d279391dbbf6b11547c46f7565477027cd7
3
+ size 4155053792
NeuralSirKrishna-7b-IQ4_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d6c0cbe3ee14529cdf601e4b2a04d788cbaeb1edf92910783be0b651f2623197
3
+ size 3944388320
NeuralSirKrishna-7b-Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0ca1ff63e9e8a8ba7be6db0c9f90f3cefe361995b2b60c0bbede8c8d02d81880
3
+ size 2719241952
NeuralSirKrishna-7b-Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:82f07177a80f4a454d706419005f0478ddfa271fbdbe2fc229c99eb876c72019
3
+ size 3822024416
NeuralSirKrishna-7b-Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:94061274a63f2885197d539a028296e76920f9c68da933f206f03395052285f0
3
+ size 3518985952
NeuralSirKrishna-7b-Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5034fcb64e91f7a12518d8a018cb544ca1b6de53f3424c4bb7f650c882c88ad9
3
+ size 3164567264
NeuralSirKrishna-7b-Q4_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ce976fb22c62f589a0e528db6e6773b31b55eef81645bd01f0cc8f0e2954e26
3
+ size 4108916448
NeuralSirKrishna-7b-Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ae6a36c5d72a150bdc7057bf12f66ebc037814927a3874554b6aca50357d8bd3
3
+ size 4368439008
NeuralSirKrishna-7b-Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:4fc02b2c9434eab49a4eb7e99151129d42669d04270148f2aa7dac197cf63059
3
+ size 4140373728
NeuralSirKrishna-7b-Q5_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a79f0913fd92b52e155b5151d153d0a6e37318cc60c1e9f789ea55e425925b35
3
+ size 4997715680
NeuralSirKrishna-7b-Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e164ce4f6b9f3aee0825ea592ea0a029dd8fb68957bd5f908b1456d69fa5bb29
3
+ size 5131409120
NeuralSirKrishna-7b-Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e7fea383b3ec2e3d0f219b1916b86b3664056d21b0804efb8c5c59d1518db935
3
+ size 4997715680
NeuralSirKrishna-7b-Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f375afb88d7d72150065caa1cb70c9f409d249b0c475a5d7070eeb3bfa66b74b
3
+ size 5942064864
NeuralSirKrishna-7b-Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3ef7bf7c6908ceeed55e102fba71aa68e032f079b99c9dddc8e160a92731c3f
3
+ size 7695857376
README.md ADDED
@@ -0,0 +1,144 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ tags:
4
+ - merge
5
+ - mergekit
6
+ - lazymergekit
7
+ - Kukedlc/NeuralKrishna-7B-v3
8
+ - Kukedlc/NeuralMarioMonarch-7B-slerp
9
+ - liminerity/M7-7b
10
+ base_model:
11
+ - Kukedlc/NeuralKrishna-7B-v3
12
+ - Kukedlc/NeuralMarioMonarch-7B-slerp
13
+ - liminerity/M7-7b
14
+ model-index:
15
+ - name: NeuralSirKrishna-7b
16
+ results:
17
+ - task:
18
+ type: text-generation
19
+ name: Text Generation
20
+ dataset:
21
+ name: AI2 Reasoning Challenge (25-Shot)
22
+ type: ai2_arc
23
+ config: ARC-Challenge
24
+ split: test
25
+ args:
26
+ num_few_shot: 25
27
+ metrics:
28
+ - type: acc_norm
29
+ value: 73.72
30
+ name: normalized accuracy
31
+ source:
32
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Kukedlc/NeuralSirKrishna-7b
33
+ name: Open LLM Leaderboard
34
+ - task:
35
+ type: text-generation
36
+ name: Text Generation
37
+ dataset:
38
+ name: HellaSwag (10-Shot)
39
+ type: hellaswag
40
+ split: validation
41
+ args:
42
+ num_few_shot: 10
43
+ metrics:
44
+ - type: acc_norm
45
+ value: 89.05
46
+ name: normalized accuracy
47
+ source:
48
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Kukedlc/NeuralSirKrishna-7b
49
+ name: Open LLM Leaderboard
50
+ - task:
51
+ type: text-generation
52
+ name: Text Generation
53
+ dataset:
54
+ name: MMLU (5-Shot)
55
+ type: cais/mmlu
56
+ config: all
57
+ split: test
58
+ args:
59
+ num_few_shot: 5
60
+ metrics:
61
+ - type: acc
62
+ value: 64.63
63
+ name: accuracy
64
+ source:
65
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Kukedlc/NeuralSirKrishna-7b
66
+ name: Open LLM Leaderboard
67
+ - task:
68
+ type: text-generation
69
+ name: Text Generation
70
+ dataset:
71
+ name: TruthfulQA (0-shot)
72
+ type: truthful_qa
73
+ config: multiple_choice
74
+ split: validation
75
+ args:
76
+ num_few_shot: 0
77
+ metrics:
78
+ - type: mc2
79
+ value: 75.6
80
+ source:
81
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Kukedlc/NeuralSirKrishna-7b
82
+ name: Open LLM Leaderboard
83
+ - task:
84
+ type: text-generation
85
+ name: Text Generation
86
+ dataset:
87
+ name: Winogrande (5-shot)
88
+ type: winogrande
89
+ config: winogrande_xl
90
+ split: validation
91
+ args:
92
+ num_few_shot: 5
93
+ metrics:
94
+ - type: acc
95
+ value: 85.32
96
+ name: accuracy
97
+ source:
98
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Kukedlc/NeuralSirKrishna-7b
99
+ name: Open LLM Leaderboard
100
+ - task:
101
+ type: text-generation
102
+ name: Text Generation
103
+ dataset:
104
+ name: GSM8k (5-shot)
105
+ type: gsm8k
106
+ config: main
107
+ split: test
108
+ args:
109
+ num_few_shot: 5
110
+ metrics:
111
+ - type: acc
112
+ value: 71.27
113
+ name: accuracy
114
+ source:
115
+ url: https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard?query=Kukedlc/NeuralSirKrishna-7b
116
+ name: Open LLM Leaderboard
117
+ quantized_by: bartowski
118
+ pipeline_tag: text-generation
119
+ ---
120
+
121
+ ## Llamacpp Quantizations of NeuralSirKrishna-7b
122
+
123
+ Using <a href="https://github.com/ggerganov/llama.cpp/">llama.cpp</a> release <a href="https://github.com/ggerganov/llama.cpp/releases/tag/b2440">b2440</a> for quantization.
124
+
125
+ Original model: https://huggingface.co/Kukedlc/NeuralSirKrishna-7b/
126
+
127
+ Download a file (not the whole branch) from below:
128
+
129
+ | Filename | Quant type | File Size | Description |
130
+ | -------- | ---------- | --------- | ----------- |
131
+ | [NeuralSirKrishna-7b-Q8_0.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q8_0.gguf) | Q8_0 | 7.69GB | Extremely high quality, generally unneeded but max available quant. |
132
+ | [NeuralSirKrishna-7b-Q6_K.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q6_K.gguf) | Q6_K | 5.94GB | Very high quality, near perfect, *recommended*. |
133
+ | [NeuralSirKrishna-7b-Q5_K_M.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q5_K_M.gguf) | Q5_K_M | 5.13GB | High quality, very usable. |
134
+ | [NeuralSirKrishna-7b-Q5_K_S.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q5_K_S.gguf) | Q5_K_S | 4.99GB | High quality, very usable. |
135
+ | [NeuralSirKrishna-7b-Q5_0.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q5_0.gguf) | Q5_0 | 4.99GB | High quality, older format, generally not recommended. |
136
+ | [NeuralSirKrishna-7b-Q4_K_M.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q4_K_M.gguf) | Q4_K_M | 4.36GB | Good quality, similar to 4.25 bpw. |
137
+ | [NeuralSirKrishna-7b-Q4_K_S.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q4_K_S.gguf) | Q4_K_S | 4.14GB | Slightly lower quality with small space savings. |
138
+ | [NeuralSirKrishna-7b-Q4_0.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q4_0.gguf) | Q4_0 | 4.10GB | Decent quality, older format, generally not recommended. |
139
+ | [NeuralSirKrishna-7b-Q3_K_L.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q3_K_L.gguf) | Q3_K_L | 3.82GB | Lower quality but usable, good for low RAM availability. |
140
+ | [NeuralSirKrishna-7b-Q3_K_M.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q3_K_M.gguf) | Q3_K_M | 3.51GB | Even lower quality. |
141
+ | [NeuralSirKrishna-7b-Q3_K_S.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q3_K_S.gguf) | Q3_K_S | 3.16GB | Low quality, not recommended. |
142
+ | [NeuralSirKrishna-7b-Q2_K.gguf](https://huggingface.co/bartowski/NeuralSirKrishna-7b-GGUF/blob/main/NeuralSirKrishna-7b-Q2_K.gguf) | Q2_K | 2.71GB | Extremely low quality, *not* recommended.
143
+
144
+ Want to support my work? Visit my ko-fi page here: https://ko-fi.com/bartowski