paul-stansifer commited on
Commit
d84b02e
1 Parent(s): 722ee79

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +33 -0
README.md ADDED
@@ -0,0 +1,33 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model: paul-stansifer/qw-us-gemma2-9b
3
+ language:
4
+ - en
5
+ license: apache-2.0
6
+ tags:
7
+ - text-generation-inference
8
+ - transformers
9
+ - unsloth
10
+ - gemma2
11
+ - trl
12
+ - llama-cpp
13
+ - gguf-my-lora
14
+ datasets:
15
+ - paul-stansifer/qwantz-strips
16
+ pipeline_tag: text-generation
17
+ ---
18
+
19
+ # paul-stansifer/qw-us-gemma2-9b-Q8_0-GGUF
20
+ This LoRA adapter was converted to GGUF format from [`paul-stansifer/qw-us-gemma2-9b`](https://huggingface.co/paul-stansifer/qw-us-gemma2-9b) via the ggml.ai's [GGUF-my-lora](https://huggingface.co/spaces/ggml-org/gguf-my-lora) space.
21
+ Refer to the [original adapter repository](https://huggingface.co/paul-stansifer/qw-us-gemma2-9b) for more details.
22
+
23
+ ## Use with llama.cpp
24
+
25
+ ```bash
26
+ # with cli
27
+ llama-cli -m base_model.gguf --lora qw-us-gemma2-9b-q8_0.gguf (...other args)
28
+
29
+ # with server
30
+ llama-server -m base_model.gguf --lora qw-us-gemma2-9b-q8_0.gguf (...other args)
31
+ ```
32
+
33
+ To know more about LoRA usage with llama.cpp server, refer to the [llama.cpp server documentation](https://github.com/ggerganov/llama.cpp/blob/master/examples/server/README.md).