paul-stansifer
commited on
Commit
•
d84b02e
1
Parent(s):
722ee79
Upload README.md with huggingface_hub
Browse files
README.md
ADDED
@@ -0,0 +1,33 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
base_model: paul-stansifer/qw-us-gemma2-9b
|
3 |
+
language:
|
4 |
+
- en
|
5 |
+
license: apache-2.0
|
6 |
+
tags:
|
7 |
+
- text-generation-inference
|
8 |
+
- transformers
|
9 |
+
- unsloth
|
10 |
+
- gemma2
|
11 |
+
- trl
|
12 |
+
- llama-cpp
|
13 |
+
- gguf-my-lora
|
14 |
+
datasets:
|
15 |
+
- paul-stansifer/qwantz-strips
|
16 |
+
pipeline_tag: text-generation
|
17 |
+
---
|
18 |
+
|
19 |
+
# paul-stansifer/qw-us-gemma2-9b-Q8_0-GGUF
|
20 |
+
This LoRA adapter was converted to GGUF format from [`paul-stansifer/qw-us-gemma2-9b`](https://huggingface.co/paul-stansifer/qw-us-gemma2-9b) via the ggml.ai's [GGUF-my-lora](https://huggingface.co/spaces/ggml-org/gguf-my-lora) space.
|
21 |
+
Refer to the [original adapter repository](https://huggingface.co/paul-stansifer/qw-us-gemma2-9b) for more details.
|
22 |
+
|
23 |
+
## Use with llama.cpp
|
24 |
+
|
25 |
+
```bash
|
26 |
+
# with cli
|
27 |
+
llama-cli -m base_model.gguf --lora qw-us-gemma2-9b-q8_0.gguf (...other args)
|
28 |
+
|
29 |
+
# with server
|
30 |
+
llama-server -m base_model.gguf --lora qw-us-gemma2-9b-q8_0.gguf (...other args)
|
31 |
+
```
|
32 |
+
|
33 |
+
To know more about LoRA usage with llama.cpp server, refer to the [llama.cpp server documentation](https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md).
|