MaziyarPanahi commited on
Commit
d83f33e
·
verified ·
1 Parent(s): 2e215e1

Upload folder using huggingface_hub (#1)

Browse files

- 58921dce9808db69c818cd7ef2253c92511b450bfb753a7429c463ced1ab6ad1 (6cbf41c510f8feec59a99f7c8d241610741f165a)
- 0fc28181d54f0c7a46c386019e0d243ca13e6c1a8cfa1176e31fb88c0bc0c66e (fa161dd40c19a7932a341c21de5d98249d2a2a57)
- 440e7af9efadc0cae9aaea3a2b2d2437d81f0da04f89c9daea4fa9c4ff0768f7 (6b440335a4a11fba31d8fc8c52abb64a6d24bc7d)
- 7c190992f9dfd49b8c22e7fbe87faadf151c9c81a6a55d6d04c3cd6c9c1e272d (1fc0ebada6fd00d52b5ee37f94cb66fd826b7b8f)
- 08d5eb66889a053a86a0e0fc5d4502660cfbbc0536a0e9c46e8fb409dbe26f34 (97554279b21884d34728bf0ed974a2c1139f2ac8)
- bd8d0650ee7da4375206c40ed1d9c545eb0628a84e5e67d5a3cc2c47ed338aa5 (2107450851a5a0e132390a196a9456cb5e761047)
- 5445187ee8d87cb1a5bcabba4803f0d5d07612275651abcab1f204cb1c0155e3 (4a26fb3a395cf801f584ac456e88834ea78dcf44)
- df55af329e2942fc594678c12825bf9b10f89fc71dfe7976df0d65bd7a777c12 (263198e67e8cfbf76bd33ad1fb6c558414491170)
- 761d84ba2805fb01959fc61401bc125f9fcc1c50dfb81e95f102baffbcc21208 (122ddc4b7ca3ffc6ff7f2d5c9be495cece257b14)
- ef1aee639c192b832c0e596f416082d3e1e942d26fbae89df3b85ebc8896a691 (b27cb8f9e7c73b4d14debd9c323707e17f053ccc)
- bbb61fb1b9abf4297b0f1210caa30b467ba6ead6e735c6b81c129d714ea34eee (0ea602442014843a6beee8621adbd6d4a3ddaadb)
- aa10533090e7cbb3596037b9ebd1fb8b714edd625e31fce094a7b04cc91fa24b (b8bbfa43c78394610eba96293a535ad9bb9db96b)
- e89a01b26463adff4fdfb12196d54324d69d1b82b5ef01fdd93c7d3137a85273 (9daf4e6a3546d97fbed276a74318b67d6c208fb7)
- a495a4b1f1cc828c434e6d5cf19b7745f200368646364da4c7d5fbe64b9bde4f (0943178755022e74fd41c37823528932afaca64e)
- 1d1150edbb106c67fad07beb8d37e9579fbdb8926bac5ac2db9250af34f9adbf (95024d398f37b4e6ef949816087f18c0a65758cc)
- deeec3669f2150aa7cb829a89ee020ec61b209a21c1929407e35cbfb5bb31471 (15f38168eb8f01bfdf508254acd3bc789ff27762)
- dc68d01f5fb9887742d14e8589c90379a0f5ce95fe4ce8f0acd11d2e861efbd4 (ab05d06673bbcc22ce754f8f5b046302636b2177)

.gitattributes CHANGED
@@ -33,3 +33,20 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
33
  *.zip filter=lfs diff=lfs merge=lfs -text
34
  *.zst filter=lfs diff=lfs merge=lfs -text
35
  *tfevents* filter=lfs diff=lfs merge=lfs -text
36
+ Qwen2.5-14B-Instruct.IQ1_M.gguf filter=lfs diff=lfs merge=lfs -text
37
+ Qwen2.5-14B-Instruct.IQ1_S.gguf filter=lfs diff=lfs merge=lfs -text
38
+ Qwen2.5-14B-Instruct.IQ2_XS.gguf filter=lfs diff=lfs merge=lfs -text
39
+ Qwen2.5-14B-Instruct.IQ3_XS.gguf filter=lfs diff=lfs merge=lfs -text
40
+ Qwen2.5-14B-Instruct.IQ4_XS.gguf filter=lfs diff=lfs merge=lfs -text
41
+ Qwen2.5-14B-Instruct.Q2_K.gguf filter=lfs diff=lfs merge=lfs -text
42
+ Qwen2.5-14B-Instruct.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
43
+ Qwen2.5-14B-Instruct.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
44
+ Qwen2.5-14B-Instruct.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
45
+ Qwen2.5-14B-Instruct.Q4_K_M.gguf filter=lfs diff=lfs merge=lfs -text
46
+ Qwen2.5-14B-Instruct.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
47
+ Qwen2.5-14B-Instruct.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
48
+ Qwen2.5-14B-Instruct.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
49
+ Qwen2.5-14B-Instruct.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
50
+ Qwen2.5-14B-Instruct.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
51
+ Qwen2.5-14B-Instruct.fp16.gguf filter=lfs diff=lfs merge=lfs -text
52
+ Qwen2.5-14B-Instruct-GGUF_imatrix.dat filter=lfs diff=lfs merge=lfs -text
Qwen2.5-14B-Instruct-GGUF_imatrix.dat ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d3c49663e84d0de5639c43658816d7723b25b9ffb8339f891a78316e16d97be2
3
+ size 8563586
Qwen2.5-14B-Instruct.IQ1_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bf0a1f39f566b7ceecb04f6992de1d164b0ee9d89a32c70dda6b84c227286679
3
+ size 3872309344
Qwen2.5-14B-Instruct.IQ1_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1c6433848c1743e10781d154781eafbaf38bbdc12c531f84b67cf7de77a2cd9c
3
+ size 3607994464
Qwen2.5-14B-Instruct.IQ2_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:62406e612609ef432ac844cd20b31a91e4b136b248aef519f052f09f02f250ec
3
+ size 4704575584
Qwen2.5-14B-Instruct.IQ3_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a1bb47187017fd0571224dadc603bc2dda54862c8bcdd8fcf83c72ef963e2cb6
3
+ size 6383362144
Qwen2.5-14B-Instruct.IQ4_XS.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7fd975d4a158f63bf9d76972f6c84ad59d90f6b68437e019fe576149ad41ea1
3
+ size 8119840864
Qwen2.5-14B-Instruct.Q2_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:58a079a9fd9e30538b16ddc7665c37ee015e65a39aa56d15415a023f8959210e
3
+ size 5770498144
Qwen2.5-14B-Instruct.Q3_K_L.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:935b595b2d471a22599c5ea3b002646278db2554381e367ee89f6e2fe8fca6a8
3
+ size 7924768864
Qwen2.5-14B-Instruct.Q3_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:be8bb081beba39d66107a00b113c4bfcee1949bf902a0cf64b6b034e6fdce750
3
+ size 7339204704
Qwen2.5-14B-Instruct.Q3_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3cb7f2afb40b09ffcd57e67515def3f598b3ad7aeb7b936f7b53ce94fb256d3e
3
+ size 6659596384
Qwen2.5-14B-Instruct.Q4_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5f3df2255a00c6f1c64f35246e0cf3c37d06fa277c28b4d3fec45a151996afea
3
+ size 8988110944
Qwen2.5-14B-Instruct.Q4_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3f2cc5c0c06b77f14d63f2cf14fe253c789b24442fb79b862244787a95af6cd0
3
+ size 8573431904
Qwen2.5-14B-Instruct.Q5_K_M.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a562b34eaf0913b7f6b88434fa650f4942611acef5c1edba7ad16ec8edeb8a62
3
+ size 10508873824
Qwen2.5-14B-Instruct.Q5_K_S.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3bf7f7a8b103c787e1467bf4fc4a153e55b922c20534426143b8e95200344934
3
+ size 10266554464
Qwen2.5-14B-Instruct.Q6_K.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3977f10ff64edb13eb01eeda28ba5bf98604424f39e20ab63fbf2b4680f945eb
3
+ size 12124684384
Qwen2.5-14B-Instruct.Q8_0.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f3b2fe6cf51878d3ebbd80a173c8e5472327fd9633b287251c0d5cf203fd913f
3
+ size 15701598304
Qwen2.5-14B-Instruct.fp16.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5fba8bdd3c6c3c454e79209c04de2fbf144acd7b0c023834f2c91c050f1ffb13
3
+ size 29547716480
README.md ADDED
@@ -0,0 +1,46 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ tags:
3
+ - quantized
4
+ - 2-bit
5
+ - 3-bit
6
+ - 4-bit
7
+ - 5-bit
8
+ - 6-bit
9
+ - 8-bit
10
+ - GGUF
11
+ - text-generation
12
+ - text-generation
13
+ model_name: Qwen2.5-14B-Instruct-GGUF
14
+ base_model: Qwen/Qwen2.5-14B-Instruct
15
+ inference: false
16
+ model_creator: Qwen
17
+ pipeline_tag: text-generation
18
+ quantized_by: MaziyarPanahi
19
+ ---
20
+ # [MaziyarPanahi/Qwen2.5-14B-Instruct-GGUF](https://huggingface.co/MaziyarPanahi/Qwen2.5-14B-Instruct-GGUF)
21
+ - Model creator: [Qwen](https://huggingface.co/Qwen)
22
+ - Original model: [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
23
+
24
+ ## Description
25
+ [MaziyarPanahi/Qwen2.5-14B-Instruct-GGUF](https://huggingface.co/MaziyarPanahi/Qwen2.5-14B-Instruct-GGUF) contains GGUF format model files for [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct).
26
+
27
+ ### About GGUF
28
+
29
+ GGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.
30
+
31
+ Here is an incomplete list of clients and libraries that are known to support GGUF:
32
+
33
+ * [llama.cpp](https://github.com/ggerganov/llama.cpp). The source project for GGUF. Offers a CLI and a server option.
34
+ * [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), a Python library with GPU accel, LangChain support, and OpenAI-compatible API server.
35
+ * [LM Studio](https://lmstudio.ai/), an easy-to-use and powerful local GUI for Windows and macOS (Silicon), with GPU acceleration. Linux available, in beta as of 27/11/2023.
36
+ * [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI, with many features and powerful extensions. Supports GPU acceleration.
37
+ * [KoboldCpp](https://github.com/LostRuins/koboldcpp), a fully featured web UI, with GPU accel across all platforms and GPU architectures. Especially good for story telling.
38
+ * [GPT4All](https://gpt4all.io/index.html), a free and open source local running GUI, supporting Windows, Linux and macOS with full GPU accel.
39
+ * [LoLLMS Web UI](https://github.com/ParisNeo/lollms-webui), a great web UI with many interesting and unique features, including a full model library for easy model selection.
40
+ * [Faraday.dev](https://faraday.dev/), an attractive and easy to use character-based chat GUI for Windows and macOS (both Silicon and Intel), with GPU acceleration.
41
+ * [candle](https://github.com/huggingface/candle), a Rust ML framework with a focus on performance, including GPU support, and ease of use.
42
+ * [ctransformers](https://github.com/marella/ctransformers), a Python library with GPU accel, LangChain support, and OpenAI-compatible AI server. Note, as of time of writing (November 27th 2023), ctransformers has not been updated in a long time and does not support many recent models.
43
+
44
+ ## Special thanks
45
+
46
+ 🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.