M. Lemoyne
commited on
Commit
•
19a09fa
1
Parent(s):
01edf45
Update README.md
Browse files
README.md
CHANGED
@@ -13,19 +13,4 @@ datasets:
|
|
13 |
|
14 |
## Overview
|
15 |
|
16 |
-
This is
|
17 |
-
|
18 |
-
**This is a (merged) QLoRA fine-tune (rank 64)**.
|
19 |
-
|
20 |
-
The finetune was performed with 1x RTX 6000 Ada (~80 hours to this checkpoint). Prompts were truncated to 4096 tokens (for speed and VRAM headroom).
|
21 |
-
|
22 |
-
I have done very little testing with this model, so feedback on real world performance is appreciated!
|
23 |
-
|
24 |
-
## How to Use
|
25 |
-
|
26 |
-
Use as you would any other Hugging Face fp16 llama-2 model.
|
27 |
-
|
28 |
-
## Prompting:
|
29 |
-
|
30 |
-
Model was trained with llama-2 chat prompt format. See [jondurbin/airoboros-l2-13b-3.1.1](https://huggingface.co/jondurbin/airoboros-l2-13b-3.1.1) model card for details.
|
31 |
-
|
|
|
13 |
|
14 |
## Overview
|
15 |
|
16 |
+
This is a 4.5bpw quantized version of [bhenrym14/airoboros-3_1-yi-34b-200k](https://huggingface.co/bhenrym14/airoboros-3_1-yi-34b-200k),
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|