MaziyarPanahi
commited on
Commit
•
e38cf90
1
Parent(s):
575baed
Upload folder using huggingface_hub (#1)
Browse files- 4a9c8a0b21f6421a7929f15fee93b6f44c1b617e391cfb8e6c4def30608a5792 (961c1e20426ce86549f09cd3e72c01eb098cfeaa)
- 8ea69a7f8802f81dd3d9fc57fa5c45b88b55fd97ee0cfc7d880e28ed313b5c2f (e6e2c1930bc257605bae9d9d85388a1d94b6d191)
- 86795af66871ea3a07c996273bcc31b24840cdcfdef3373ebdc8f95dcb124bc9 (a00cddd16cf090af82cfbf204b8541308f0f3c0b)
- 5ce468a8673de93764128258acade475b5cf8a67e0644845defef1caedd830c3 (c2d3f9f54cb8bcf8bf14bebdd618cadbef9461a5)
- ca3ca1b5bb3bcb92bd1724f71969e5becdd1c9e4ca1e6a1eaf6c59b7dd68ad55 (c640b1c004c171206c0dcc7006d336ad11552c7f)
- 76efe4b6f4b027064354d797229ccbd792ca94cb70167d2ba87fe9f0883e4e78 (9fc62327bab69abef18099269914c622e1fd1de7)
- .gitattributes +6 -0
- EZO-Llama-3.2-3B-Instruct-dpoE-GGUF_imatrix.dat +3 -0
- EZO-Llama-3.2-3B-Instruct-dpoE.Q5_K_M.gguf +3 -0
- EZO-Llama-3.2-3B-Instruct-dpoE.Q5_K_S.gguf +3 -0
- EZO-Llama-3.2-3B-Instruct-dpoE.Q6_K.gguf +3 -0
- EZO-Llama-3.2-3B-Instruct-dpoE.Q8_0.gguf +3 -0
- EZO-Llama-3.2-3B-Instruct-dpoE.fp16.gguf +3 -0
- README.md +46 -0
.gitattributes
CHANGED
@@ -33,3 +33,9 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
EZO-Llama-3.2-3B-Instruct-dpoE.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
EZO-Llama-3.2-3B-Instruct-dpoE.Q5_K_S.gguf filter=lfs diff=lfs merge=lfs -text
|
38 |
+
EZO-Llama-3.2-3B-Instruct-dpoE.Q6_K.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
EZO-Llama-3.2-3B-Instruct-dpoE.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
EZO-Llama-3.2-3B-Instruct-dpoE.fp16.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
EZO-Llama-3.2-3B-Instruct-dpoE-GGUF_imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
EZO-Llama-3.2-3B-Instruct-dpoE-GGUF_imatrix.dat
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5ccb1778af63b34ed5a32a7843973c43c7b6aa2b4a19bb8cb907b303bd30ab2b
|
3 |
+
size 2988366
|
EZO-Llama-3.2-3B-Instruct-dpoE.Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3a91a9ee40b5eb4c998284cc837d18080ee15c42da645940b82dd4e3bab10393
|
3 |
+
size 2322667008
|
EZO-Llama-3.2-3B-Instruct-dpoE.Q5_K_S.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7f28fb6addaef5fe5bc04a806eaf728075f5b6fe4cb3ab1fd2f86b0185a1f337
|
3 |
+
size 2270025216
|
EZO-Llama-3.2-3B-Instruct-dpoE.Q6_K.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ba2e68ed88709c2bad52eb31b7151a59d25044ac3d359b1174f25ce220ce3cfc
|
3 |
+
size 2644366848
|
EZO-Llama-3.2-3B-Instruct-dpoE.Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:2ec76df50836d0153b063b8bc2750741c5745a5373e803d6e3a4be6d9ab9d6c9
|
3 |
+
size 3422412288
|
EZO-Llama-3.2-3B-Instruct-dpoE.fp16.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:358189411a322ee4dc9445fd42ddd3f983d2f87ce14e7dcb66c75d54720da370
|
3 |
+
size 6434200832
|
README.md
ADDED
@@ -0,0 +1,46 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
tags:
|
3 |
+
- quantized
|
4 |
+
- 2-bit
|
5 |
+
- 3-bit
|
6 |
+
- 4-bit
|
7 |
+
- 5-bit
|
8 |
+
- 6-bit
|
9 |
+
- 8-bit
|
10 |
+
- GGUF
|
11 |
+
- text-generation
|
12 |
+
- text-generation
|
13 |
+
model_name: EZO-Llama-3.2-3B-Instruct-dpoE-GGUF
|
14 |
+
base_model: AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE
|
15 |
+
inference: false
|
16 |
+
model_creator: AXCXEPT
|
17 |
+
pipeline_tag: text-generation
|
18 |
+
quantized_by: MaziyarPanahi
|
19 |
+
---
|
20 |
+
# [MaziyarPanahi/EZO-Llama-3.2-3B-Instruct-dpoE-GGUF](https://huggingface.co/MaziyarPanahi/EZO-Llama-3.2-3B-Instruct-dpoE-GGUF)
|
21 |
+
- Model creator: [AXCXEPT](https://huggingface.co/AXCXEPT)
|
22 |
+
- Original model: [AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE](https://huggingface.co/AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE)
|
23 |
+
|
24 |
+
## Description
|
25 |
+
[MaziyarPanahi/EZO-Llama-3.2-3B-Instruct-dpoE-GGUF](https://huggingface.co/MaziyarPanahi/EZO-Llama-3.2-3B-Instruct-dpoE-GGUF) contains GGUF format model files for [AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE](https://huggingface.co/AXCXEPT/EZO-Llama-3.2-3B-Instruct-dpoE).
|
26 |
+
|
27 |
+
### About GGUF
|
28 |
+
|
29 |
+
GGUF is a new format introduced by the llama.cpp team on August 21st 2023. It is a replacement for GGML, which is no longer supported by llama.cpp.
|
30 |
+
|
31 |
+
Here is an incomplete list of clients and libraries that are known to support GGUF:
|
32 |
+
|
33 |
+
* [llama.cpp](https://github.com/ggerganov/llama.cpp). The source project for GGUF. Offers a CLI and a server option.
|
34 |
+
* [llama-cpp-python](https://github.com/abetlen/llama-cpp-python), a Python library with GPU accel, LangChain support, and OpenAI-compatible API server.
|
35 |
+
* [LM Studio](https://lmstudio.ai/), an easy-to-use and powerful local GUI for Windows and macOS (Silicon), with GPU acceleration. Linux available, in beta as of 27/11/2023.
|
36 |
+
* [text-generation-webui](https://github.com/oobabooga/text-generation-webui), the most widely used web UI, with many features and powerful extensions. Supports GPU acceleration.
|
37 |
+
* [KoboldCpp](https://github.com/LostRuins/koboldcpp), a fully featured web UI, with GPU accel across all platforms and GPU architectures. Especially good for story telling.
|
38 |
+
* [GPT4All](https://gpt4all.io/index.html), a free and open source local running GUI, supporting Windows, Linux and macOS with full GPU accel.
|
39 |
+
* [LoLLMS Web UI](https://github.com/ParisNeo/lollms-webui), a great web UI with many interesting and unique features, including a full model library for easy model selection.
|
40 |
+
* [Faraday.dev](https://faraday.dev/), an attractive and easy to use character-based chat GUI for Windows and macOS (both Silicon and Intel), with GPU acceleration.
|
41 |
+
* [candle](https://github.com/huggingface/candle), a Rust ML framework with a focus on performance, including GPU support, and ease of use.
|
42 |
+
* [ctransformers](https://github.com/marella/ctransformers), a Python library with GPU accel, LangChain support, and OpenAI-compatible AI server. Note, as of time of writing (November 27th 2023), ctransformers has not been updated in a long time and does not support many recent models.
|
43 |
+
|
44 |
+
## Special thanks
|
45 |
+
|
46 |
+
🙏 Special thanks to [Georgi Gerganov](https://github.com/ggerganov) and the whole team working on [llama.cpp](https://github.com/ggerganov/llama.cpp/) for making all of this possible.
|