chrisociepa commited on
Commit
5c4b3db
1 Parent(s): d2c192d

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +28 -0
README.md CHANGED
@@ -1,3 +1,31 @@
1
  ---
2
  license: cc-by-nc-4.0
 
 
 
 
 
 
3
  ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: cc-by-nc-4.0
3
+ language:
4
+ - pl
5
+ tags:
6
+ - llama
7
+ - qlora
8
+ - instruct
9
  ---
10
+
11
+ This repo contains a qlora adapter for [Llama-2-7b](https://huggingface.co/meta-llama/Llama-2-7b-hf), trained on 1B tokens (available [here](https://huggingface.co/Azurro/llama-2-7b-qlora-polish)) and subsequently fine-tuned on a private instructions dataset, exclusively in Polish.
12
+
13
+ The fine-tuning took 1 hour on a single RTX 4090 with the following hyperparameters:
14
+
15
+ * context length: 2048
16
+ * batch_size: 16
17
+ * learning_rate: 0.0001
18
+ * lora_r: 64
19
+ * lora_alpha: 16
20
+ * lora_modules: all
21
+ * lora_dropout: 0.0
22
+ * weight_decay: 0.1
23
+ * max_grad_norm: 0.3
24
+ * double_quant, nf4
25
+ * optimizer: paged_adamw_32bit (beta2: 0.999)
26
+
27
+ This adapter allows the model to speak Polish more accurately than vanilla [Llama-2-7b](https://huggingface.co/meta-llama/Llama-2-7b-hf).
28
+
29
+ <p align="center">
30
+ <img src="https://huggingface.co/Azurro/llama-2-7b-qlora-polish-instruct/raw/main/llama-2-7b-qlora-polish-instruct.jpg">
31
+ </p>