End of training
Browse files- README.md +15 -1
- adapter_model.bin +3 -0
README.md
CHANGED
@@ -110,7 +110,9 @@ tokens: # these are delimiters
|
|
110 |
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ritualnah/ppml/runs/3q9smy0v)
|
111 |
# answer-emojis
|
112 |
|
113 |
-
This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on
|
|
|
|
|
114 |
|
115 |
## Model description
|
116 |
|
@@ -140,6 +142,18 @@ The following hyperparameters were used during training:
|
|
140 |
- lr_scheduler_warmup_steps: 10
|
141 |
- num_epochs: 3
|
142 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
143 |
### Framework versions
|
144 |
|
145 |
- PEFT 0.11.1
|
|
|
110 |
[<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ritualnah/ppml/runs/3q9smy0v)
|
111 |
# answer-emojis
|
112 |
|
113 |
+
This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
|
114 |
+
It achieves the following results on the evaluation set:
|
115 |
+
- Loss: 0.5239
|
116 |
|
117 |
## Model description
|
118 |
|
|
|
142 |
- lr_scheduler_warmup_steps: 10
|
143 |
- num_epochs: 3
|
144 |
|
145 |
+
### Training results
|
146 |
+
|
147 |
+
| Training Loss | Epoch | Step | Validation Loss |
|
148 |
+
|:-------------:|:------:|:----:|:---------------:|
|
149 |
+
| 1.0155 | 0.0082 | 1 | 1.2302 |
|
150 |
+
| 0.5161 | 0.5031 | 61 | 0.5744 |
|
151 |
+
| 0.5398 | 1.0062 | 122 | 0.5379 |
|
152 |
+
| 0.4614 | 1.4990 | 183 | 0.5295 |
|
153 |
+
| 0.4323 | 2.0021 | 244 | 0.5178 |
|
154 |
+
| 0.3823 | 2.4948 | 305 | 0.5239 |
|
155 |
+
|
156 |
+
|
157 |
### Framework versions
|
158 |
|
159 |
- PEFT 0.11.1
|
adapter_model.bin
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:85c4faae61bbc825e0cc2cb4fffec4970bbf8e5b964b2234454c6447ab66e1fa
|
3 |
+
size 1368620762
|