Upload folder using huggingface_hub
Browse files- Phi-3-mini-4k-instruct-Q2_K.gguf +2 -2
- Phi-3-mini-4k-instruct-Q3_K_L.gguf +2 -2
- Phi-3-mini-4k-instruct-Q3_K_M.gguf +2 -2
- Phi-3-mini-4k-instruct-Q3_K_S.gguf +2 -2
- Phi-3-mini-4k-instruct-Q4_0.gguf +2 -2
- Phi-3-mini-4k-instruct-Q4_K_M.gguf +2 -2
- Phi-3-mini-4k-instruct-Q4_K_S.gguf +2 -2
- Phi-3-mini-4k-instruct-Q5_0.gguf +2 -2
- Phi-3-mini-4k-instruct-Q5_K_M.gguf +2 -2
- Phi-3-mini-4k-instruct-Q5_K_S.gguf +2 -2
- Phi-3-mini-4k-instruct-Q6_K.gguf +2 -2
- Phi-3-mini-4k-instruct-Q8_0.gguf +2 -2
- README.md +15 -22
Phi-3-mini-4k-instruct-Q2_K.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9b82044047ddec6ab4137d06651361840a7f1008a0eae8eea597e27759fbadec
|
3 |
+
size 1446880320
|
Phi-3-mini-4k-instruct-Q3_K_L.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:44808ba99c26ca5c89ee29d1ff1c294675d06c07b9cebda5d78841cd6830288c
|
3 |
+
size 2045135424
|
Phi-3-mini-4k-instruct-Q3_K_M.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:80f28d845dc4c6d0fef784362655f364c7a1ed196d9f858af06ca662e99065a4
|
3 |
+
size 1877625408
|
Phi-3-mini-4k-instruct-Q3_K_S.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:32dbfe5c6000c4c6bb4e3bc2f679f37329da4ccdb893948252ff225c28bff9cb
|
3 |
+
size 1681803840
|
Phi-3-mini-4k-instruct-Q4_0.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a2cd87ccae8eb2b0836ffd7a7a3bc122ca6a62d0f5cd93dc983c0859f6e1e7b9
|
3 |
+
size 2176182336
|
Phi-3-mini-4k-instruct-Q4_K_M.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f83f14c7bbfd894a9a7502cfbd9a6759ce8286aa9799924624f529c647a8efe5
|
3 |
+
size 2318919744
|
Phi-3-mini-4k-instruct-Q4_K_S.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9e6ac67b3ed7929d3b63c1e00220340c19295c1d278ab11b7289c88fa7b187ec
|
3 |
+
size 2193483840
|
Phi-3-mini-4k-instruct-Q5_0.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ce547afba1d0927c083583b851ff27aaaaf9dbab2064ef92485fc6b9fd70fd35
|
3 |
+
size 2641479744
|
Phi-3-mini-4k-instruct-Q5_K_M.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ea219f963f6eee55169060fc1a54185dd308a7cac14061a4653d7ed9d06a3412
|
3 |
+
size 2715011136
|
Phi-3-mini-4k-instruct-Q5_K_S.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a692c07003686dfa5bd7a39826217e45ca9db89021762c6ca4d0cdc769115b8d
|
3 |
+
size 2641479744
|
Phi-3-mini-4k-instruct-Q6_K.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9c7e9e8bad768b2e4badcef1ec0d809fa6f81fb84a9353c22e31bfa0d5d4d1ab
|
3 |
+
size 3135858240
|
Phi-3-mini-4k-instruct-Q8_0.gguf
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:863a8f851c2108ab1fed787f26deba4eeff4c5fa3e59ff7413363124e9493f35
|
3 |
+
size 4061227584
|
README.md
CHANGED
@@ -1,23 +1,16 @@
|
|
1 |
---
|
2 |
-
license: mit
|
3 |
-
license_link: https://huggingface.co/microsoft/Phi-3-mini-4k-instruct/resolve/main/LICENSE
|
4 |
language:
|
5 |
- en
|
6 |
-
|
7 |
-
|
8 |
tags:
|
9 |
-
-
|
10 |
-
-
|
|
|
|
|
11 |
- TensorBlock
|
12 |
- GGUF
|
13 |
-
|
14 |
-
parameters:
|
15 |
-
temperature: 0
|
16 |
-
widget:
|
17 |
-
- messages:
|
18 |
-
- role: user
|
19 |
-
content: Can you provide ways to eat combinations of bananas and dragonfruits?
|
20 |
-
base_model: microsoft/Phi-3-mini-4k-instruct
|
21 |
---
|
22 |
|
23 |
<div style="width: auto; margin-left: auto; margin-right: auto">
|
@@ -31,9 +24,9 @@ base_model: microsoft/Phi-3-mini-4k-instruct
|
|
31 |
</div>
|
32 |
</div>
|
33 |
|
34 |
-
##
|
35 |
|
36 |
-
This repo contains GGUF format model files for [
|
37 |
|
38 |
The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
|
39 |
|
@@ -51,16 +44,16 @@ The files were quantized using machines provided by [TensorBlock](https://tensor
|
|
51 |
|
52 |
| Filename | Quant type | File Size | Description |
|
53 |
| -------- | ---------- | --------- | ----------- |
|
54 |
-
| [Phi-3-mini-4k-instruct-Q2_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q2_K.gguf) | Q2_K | 1.
|
55 |
| [Phi-3-mini-4k-instruct-Q3_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_S.gguf) | Q3_K_S | 1.566 GB | very small, high quality loss |
|
56 |
-
| [Phi-3-mini-4k-instruct-Q3_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_M.gguf) | Q3_K_M | 1.
|
57 |
-
| [Phi-3-mini-4k-instruct-Q3_K_L.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_L.gguf) | Q3_K_L | 1.
|
58 |
| [Phi-3-mini-4k-instruct-Q4_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_0.gguf) | Q4_0 | 2.027 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
|
59 |
-
| [Phi-3-mini-4k-instruct-Q4_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_S.gguf) | Q4_K_S | 2.
|
60 |
-
| [Phi-3-mini-4k-instruct-Q4_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_M.gguf) | Q4_K_M | 2.
|
61 |
| [Phi-3-mini-4k-instruct-Q5_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_0.gguf) | Q5_0 | 2.460 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
|
62 |
| [Phi-3-mini-4k-instruct-Q5_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_S.gguf) | Q5_K_S | 2.460 GB | large, low quality loss - recommended |
|
63 |
-
| [Phi-3-mini-4k-instruct-Q5_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_M.gguf) | Q5_K_M | 2.
|
64 |
| [Phi-3-mini-4k-instruct-Q6_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q6_K.gguf) | Q6_K | 2.920 GB | very large, extremely low quality loss |
|
65 |
| [Phi-3-mini-4k-instruct-Q8_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q8_0.gguf) | Q8_0 | 3.782 GB | very large, extremely low quality loss - not recommended |
|
66 |
|
|
|
1 |
---
|
|
|
|
|
2 |
language:
|
3 |
- en
|
4 |
+
library_name: transformers
|
5 |
+
license: mit
|
6 |
tags:
|
7 |
+
- unsloth
|
8 |
+
- transformers
|
9 |
+
- phi3
|
10 |
+
- phi
|
11 |
- TensorBlock
|
12 |
- GGUF
|
13 |
+
base_model: unsloth/Phi-3-mini-4k-instruct
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
14 |
---
|
15 |
|
16 |
<div style="width: auto; margin-left: auto; margin-right: auto">
|
|
|
24 |
</div>
|
25 |
</div>
|
26 |
|
27 |
+
## unsloth/Phi-3-mini-4k-instruct - GGUF
|
28 |
|
29 |
+
This repo contains GGUF format model files for [unsloth/Phi-3-mini-4k-instruct](https://huggingface.co/unsloth/Phi-3-mini-4k-instruct).
|
30 |
|
31 |
The files were quantized using machines provided by [TensorBlock](https://tensorblock.co/), and they are compatible with llama.cpp as of [commit b4011](https://github.com/ggerganov/llama.cpp/commit/a6744e43e80f4be6398fc7733a01642c846dce1d).
|
32 |
|
|
|
44 |
|
45 |
| Filename | Quant type | File Size | Description |
|
46 |
| -------- | ---------- | --------- | ----------- |
|
47 |
+
| [Phi-3-mini-4k-instruct-Q2_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q2_K.gguf) | Q2_K | 1.348 GB | smallest, significant quality loss - not recommended for most purposes |
|
48 |
| [Phi-3-mini-4k-instruct-Q3_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_S.gguf) | Q3_K_S | 1.566 GB | very small, high quality loss |
|
49 |
+
| [Phi-3-mini-4k-instruct-Q3_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_M.gguf) | Q3_K_M | 1.749 GB | very small, high quality loss |
|
50 |
+
| [Phi-3-mini-4k-instruct-Q3_K_L.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q3_K_L.gguf) | Q3_K_L | 1.905 GB | small, substantial quality loss |
|
51 |
| [Phi-3-mini-4k-instruct-Q4_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_0.gguf) | Q4_0 | 2.027 GB | legacy; small, very high quality loss - prefer using Q3_K_M |
|
52 |
+
| [Phi-3-mini-4k-instruct-Q4_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_S.gguf) | Q4_K_S | 2.043 GB | small, greater quality loss |
|
53 |
+
| [Phi-3-mini-4k-instruct-Q4_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q4_K_M.gguf) | Q4_K_M | 2.160 GB | medium, balanced quality - recommended |
|
54 |
| [Phi-3-mini-4k-instruct-Q5_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_0.gguf) | Q5_0 | 2.460 GB | legacy; medium, balanced quality - prefer using Q4_K_M |
|
55 |
| [Phi-3-mini-4k-instruct-Q5_K_S.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_S.gguf) | Q5_K_S | 2.460 GB | large, low quality loss - recommended |
|
56 |
+
| [Phi-3-mini-4k-instruct-Q5_K_M.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q5_K_M.gguf) | Q5_K_M | 2.529 GB | large, very low quality loss - recommended |
|
57 |
| [Phi-3-mini-4k-instruct-Q6_K.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q6_K.gguf) | Q6_K | 2.920 GB | very large, extremely low quality loss |
|
58 |
| [Phi-3-mini-4k-instruct-Q8_0.gguf](https://huggingface.co/tensorblock/Phi-3-mini-4k-instruct-GGUF/tree/main/Phi-3-mini-4k-instruct-Q8_0.gguf) | Q8_0 | 3.782 GB | very large, extremely low quality loss - not recommended |
|
59 |
|