Upload folder using huggingface_hub
Browse files- .gitattributes +14 -0
- README.md +2 -56
- model.shard-00001-of-00014.gguf +3 -0
- model.shard-00002-of-00014.gguf +3 -0
- model.shard-00003-of-00014.gguf +3 -0
- model.shard-00004-of-00014.gguf +3 -0
- model.shard-00005-of-00014.gguf +3 -0
- model.shard-00006-of-00014.gguf +3 -0
- model.shard-00007-of-00014.gguf +3 -0
- model.shard-00008-of-00014.gguf +3 -0
- model.shard-00009-of-00014.gguf +3 -0
- model.shard-00010-of-00014.gguf +3 -0
- model.shard-00011-of-00014.gguf +3 -0
- model.shard-00012-of-00014.gguf +3 -0
- model.shard-00013-of-00014.gguf +3 -0
- model.shard-00014-of-00014.gguf +3 -0
.gitattributes
CHANGED
@@ -35,3 +35,17 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
phi-1_5-instruct-v0.1-q5_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
phi-1_5-instruct-v0.1-q5_k_m-imat.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
imatrix.dat filter=lfs diff=lfs merge=lfs -text
|
38 |
+
model.shard-00001-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
39 |
+
model.shard-00002-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
40 |
+
model.shard-00003-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
41 |
+
model.shard-00004-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
42 |
+
model.shard-00005-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
43 |
+
model.shard-00006-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
44 |
+
model.shard-00007-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
45 |
+
model.shard-00008-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
46 |
+
model.shard-00009-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
47 |
+
model.shard-00010-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
48 |
+
model.shard-00011-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
49 |
+
model.shard-00012-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
50 |
+
model.shard-00013-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
51 |
+
model.shard-00014-of-00014.gguf filter=lfs diff=lfs merge=lfs -text
|
README.md
CHANGED
@@ -1,59 +1,5 @@
|
|
1 |
---
|
2 |
-
base_model:
|
3 |
-
datasets:
|
4 |
-
- teknium/OpenHermes-2.5
|
5 |
-
- HuggingFaceH4/ultrafeedback_binarized
|
6 |
-
- argilla/distilabel-intel-orca-dpo-pairs
|
7 |
-
- jondurbin/py-dpo-v0.1
|
8 |
-
- argilla/distilabel-math-preference-dpo
|
9 |
-
library_name: transformers
|
10 |
-
license: mit
|
11 |
-
pipeline_tag: text-generation
|
12 |
-
tags:
|
13 |
-
- llama-cpp
|
14 |
-
- gguf-my-repo
|
15 |
---
|
16 |
|
17 |
-
|
18 |
-
This model was converted to GGUF format from [`rasyosef/Phi-1_5-Instruct-v0.1`](https://huggingface.co/rasyosef/Phi-1_5-Instruct-v0.1) using llama.cpp via the ggml.ai's [GGUF-my-repo](https://huggingface.co/spaces/ggml-org/gguf-my-repo) space.
|
19 |
-
Refer to the [original model card](https://huggingface.co/rasyosef/Phi-1_5-Instruct-v0.1) for more details on the model.
|
20 |
-
|
21 |
-
## Use with llama.cpp
|
22 |
-
Install llama.cpp through brew (works on Mac and Linux)
|
23 |
-
|
24 |
-
```bash
|
25 |
-
brew install llama.cpp
|
26 |
-
|
27 |
-
```
|
28 |
-
Invoke the llama.cpp server or the CLI.
|
29 |
-
|
30 |
-
### CLI:
|
31 |
-
```bash
|
32 |
-
llama-cli --hf-repo Felladrin/Phi-1_5-Instruct-v0.1-Q5_K_M-GGUF --hf-file phi-1_5-instruct-v0.1-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
|
33 |
-
```
|
34 |
-
|
35 |
-
### Server:
|
36 |
-
```bash
|
37 |
-
llama-server --hf-repo Felladrin/Phi-1_5-Instruct-v0.1-Q5_K_M-GGUF --hf-file phi-1_5-instruct-v0.1-q5_k_m-imat.gguf -c 2048
|
38 |
-
```
|
39 |
-
|
40 |
-
Note: You can also use this checkpoint directly through the [usage steps](https://github.com/ggerganov/llama.cpp?tab=readme-ov-file#usage) listed in the Llama.cpp repo as well.
|
41 |
-
|
42 |
-
Step 1: Clone llama.cpp from GitHub.
|
43 |
-
```
|
44 |
-
git clone https://github.com/ggerganov/llama.cpp
|
45 |
-
```
|
46 |
-
|
47 |
-
Step 2: Move into the llama.cpp folder and build it with `LLAMA_CURL=1` flag along with other hardware-specific flags (for ex: LLAMA_CUDA=1 for Nvidia GPUs on Linux).
|
48 |
-
```
|
49 |
-
cd llama.cpp && LLAMA_CURL=1 make
|
50 |
-
```
|
51 |
-
|
52 |
-
Step 3: Run inference through the main binary.
|
53 |
-
```
|
54 |
-
./llama-cli --hf-repo Felladrin/Phi-1_5-Instruct-v0.1-Q5_K_M-GGUF --hf-file phi-1_5-instruct-v0.1-q5_k_m-imat.gguf -p "The meaning to life and the universe is"
|
55 |
-
```
|
56 |
-
or
|
57 |
-
```
|
58 |
-
./llama-server --hf-repo Felladrin/Phi-1_5-Instruct-v0.1-Q5_K_M-GGUF --hf-file phi-1_5-instruct-v0.1-q5_k_m-imat.gguf -c 2048
|
59 |
-
```
|
|
|
1 |
---
|
2 |
+
base_model: Felladrin/gguf-Q5_K_M-Phi-1_5-Instruct-v0.1
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
3 |
---
|
4 |
|
5 |
+
Sharded GGUF version of [Felladrin/gguf-Q5_K_M-Phi-1_5-Instruct-v0.1](https://huggingface.co/Felladrin/gguf-Q5_K_M-Phi-1_5-Instruct-v0.1).
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
model.shard-00001-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ad1cb027e6b61935e94ea2994a24d0d9c078b157d2b77820e6460beda5d32d9f
|
3 |
+
size 1970880
|
model.shard-00002-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:62896ca1f8bd6e76ed462854c448612b4f2c25de9b41bc21fe13145c15797d24
|
3 |
+
size 84499136
|
model.shard-00003-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bc3da0992d220445af9805f8c9dbdd2a03fee8229f0b8f07365f645dd6c33aab
|
3 |
+
size 82426784
|
model.shard-00004-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ef60c770e9382b88c0814db29392fed4f71e06db0f355e85385390e535e47294
|
3 |
+
size 74958464
|
model.shard-00005-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:67a20330a77b6d61d55fe321f70a7ccfa0795a35fc8b8fae948ee1ee951adf0c
|
3 |
+
size 83715840
|
model.shard-00006-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d1cbef4897923d1fb93652fb0444b4e22a8c6d147d2562bee6de74b975b250d1
|
3 |
+
size 83781824
|
model.shard-00007-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:1837d6ad5285f77b9d14921334bc944a9efcbb32d2c6359b2b0926068f14828d
|
3 |
+
size 83715840
|
model.shard-00008-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:aa24e485930be424dc818838270dac986e47b370467f040c1c95dc6e300b6c06
|
3 |
+
size 83158784
|
model.shard-00009-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d740cfb7f861b28694fd91a45b0f0486be003d5f3e61404352cb4e9b8e988c87
|
3 |
+
size 81553600
|
model.shard-00010-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:94d62fea35a120d2958c2d918367d43d4ea3e8fad836f9ae4633a419d68fc772
|
3 |
+
size 83715840
|
model.shard-00011-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:4941bd74f421a2abc5878394ff85c085b1e18c1340e2e240837f50cf134ca71d
|
3 |
+
size 83715840
|
model.shard-00012-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6a60ffc5f34b7604246b5be88c920cf1efadbd606594e9e1e6174732aecbefdc
|
3 |
+
size 83781792
|
model.shard-00013-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:af92251f28b38b246d148f5e318cfc48efb07f53b6a0116e4ab07c04dbe6679e
|
3 |
+
size 74958464
|
model.shard-00014-of-00014.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:3f1f8d8d6e4ebcbe737ba0a4fd6f975f316e7b6891983c16a9a346a71936de74
|
3 |
+
size 37429984
|