Update README.md
Browse files
README.md
CHANGED
@@ -29,6 +29,30 @@ Simple Colab notebook for testing: https://colab.research.google.com/drive/1Al9g
|
|
29 |
* **q6_k:** Uses Q8_K for all tensors
|
30 |
* **q8_0:** Almost indistinguishable from float16. High resource use and slow. Not recommended for most users.
|
31 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
32 |
### Model description:
|
33 |
|
34 |
* **Developed by:** [SpeakLeash](https://speakleash.org/) & [ACK Cyfronet AGH](https://www.cyfronet.pl/)
|
|
|
29 |
* **q6_k:** Uses Q8_K for all tensors
|
30 |
* **q8_0:** Almost indistinguishable from float16. High resource use and slow. Not recommended for most users.
|
31 |
|
32 |
+
### Ollama Modfile
|
33 |
+
The GGUF file can be used with [Ollama](https://ollama.com/). To do this, you need to import the model using the configuration defined in the Modfile. For model eg. Bielik-11B-v2.2-Instruct.Q4_K_M.gguf (full path to model location) Modfile looks like:
|
34 |
+
|
35 |
+
```
|
36 |
+
FROM ./Bielik-11B-v2.2-Instruct.Q4_K_M.gguf
|
37 |
+
|
38 |
+
TEMPLATE """<s>{{ if .System }}<|start_header_id|>system<|end_header_id|>
|
39 |
+
|
40 |
+
{{ .System }}<|eot_id|>{{ end }}{{ if .Prompt }}<|start_header_id|>user<|end_header_id|>
|
41 |
+
|
42 |
+
{{ .Prompt }}<|eot_id|>{{ end }}<|start_header_id|>assistant<|end_header_id|>
|
43 |
+
|
44 |
+
{{ .Response }}<|eot_id|>"""
|
45 |
+
|
46 |
+
PARAMETER stop "<|start_header_id|>"
|
47 |
+
PARAMETER stop "<|end_header_id|>"
|
48 |
+
PARAMETER stop "<|eot_id|>"
|
49 |
+
|
50 |
+
# Remeber to set low temperature for experimental models (1-3bits)
|
51 |
+
PARAMETER temperature 0.1
|
52 |
+
|
53 |
+
```
|
54 |
+
|
55 |
+
|
56 |
### Model description:
|
57 |
|
58 |
* **Developed by:** [SpeakLeash](https://speakleash.org/) & [ACK Cyfronet AGH](https://www.cyfronet.pl/)
|