evayzh commited on
Commit
95db18c
·
verified ·
1 Parent(s): d7b1554

End of training

Browse files
Files changed (2) hide show
  1. README.md +15 -1
  2. adapter_model.bin +3 -0
README.md CHANGED
@@ -110,7 +110,9 @@ tokens: # these are delimiters
110
  [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ritualnah/ppml/runs/3q9smy0v)
111
  # answer-emojis
112
 
113
- This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on an unknown dataset.
 
 
114
 
115
  ## Model description
116
 
@@ -140,6 +142,18 @@ The following hyperparameters were used during training:
140
  - lr_scheduler_warmup_steps: 10
141
  - num_epochs: 3
142
 
 
 
 
 
 
 
 
 
 
 
 
 
143
  ### Framework versions
144
 
145
  - PEFT 0.11.1
 
110
  [<img src="https://raw.githubusercontent.com/wandb/assets/main/wandb-github-badge-28.svg" alt="Visualize in Weights & Biases" width="200" height="32"/>](https://wandb.ai/ritualnah/ppml/runs/3q9smy0v)
111
  # answer-emojis
112
 
113
+ This model is a fine-tuned version of [NousResearch/Llama-2-7b-hf](https://huggingface.co/NousResearch/Llama-2-7b-hf) on the None dataset.
114
+ It achieves the following results on the evaluation set:
115
+ - Loss: 0.5239
116
 
117
  ## Model description
118
 
 
142
  - lr_scheduler_warmup_steps: 10
143
  - num_epochs: 3
144
 
145
+ ### Training results
146
+
147
+ | Training Loss | Epoch | Step | Validation Loss |
148
+ |:-------------:|:------:|:----:|:---------------:|
149
+ | 1.0155 | 0.0082 | 1 | 1.2302 |
150
+ | 0.5161 | 0.5031 | 61 | 0.5744 |
151
+ | 0.5398 | 1.0062 | 122 | 0.5379 |
152
+ | 0.4614 | 1.4990 | 183 | 0.5295 |
153
+ | 0.4323 | 2.0021 | 244 | 0.5178 |
154
+ | 0.3823 | 2.4948 | 305 | 0.5239 |
155
+
156
+
157
  ### Framework versions
158
 
159
  - PEFT 0.11.1
adapter_model.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:85c4faae61bbc825e0cc2cb4fffec4970bbf8e5b964b2234454c6447ab66e1fa
3
+ size 1368620762