NowaBwagel0 commited on
Commit
98f6fa3
1 Parent(s): 4b5acea

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: other
3
- base_model: JackFram/llama-68m
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # llama-68m-oasst
15
 
16
- This model is a fine-tuned version of [JackFram/llama-68m](https://huggingface.co/JackFram/llama-68m) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.8892
19
 
20
  ## Model description
21
 
@@ -42,15 +42,21 @@ The following hyperparameters were used during training:
42
  - total_train_batch_size: 8
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
- - num_epochs: 3
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 3.219 | 1.0 | 382 | 3.0836 |
52
- | 2.959 | 2.0 | 765 | 2.9271 |
53
- | 2.9323 | 3.0 | 1146 | 2.8892 |
 
 
 
 
 
 
54
 
55
 
56
  ### Framework versions
 
1
  ---
2
  license: other
3
+ base_model: NowaBwagel0/llama-68m-oasst
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # llama-68m-oasst
15
 
16
+ This model is a fine-tuned version of [NowaBwagel0/llama-68m-oasst](https://huggingface.co/NowaBwagel0/llama-68m-oasst) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.5908
19
 
20
  ## Model description
21
 
 
42
  - total_train_batch_size: 8
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
+ - num_epochs: 9
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 2.8024 | 1.0 | 382 | 2.7493 |
52
+ | 2.6581 | 2.0 | 765 | 2.6798 |
53
+ | 2.6276 | 3.0 | 1147 | 2.6429 |
54
+ | 2.5111 | 4.0 | 1530 | 2.6212 |
55
+ | 2.4614 | 5.0 | 1912 | 2.6069 |
56
+ | 2.4789 | 6.0 | 2295 | 2.5985 |
57
+ | 2.4288 | 7.0 | 2677 | 2.5942 |
58
+ | 2.4184 | 8.0 | 3060 | 2.5909 |
59
+ | 2.2978 | 8.99 | 3438 | 2.5908 |
60
 
61
 
62
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc1a310b439c4f7a47c8c0266fd5405e5c9f47b3a55f62f8e2f4d9457f5f2163
3
  size 272123144
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bff1921572492702eca1a6dcab105fdf064d92b5de5ef4a1ffaed388b2d7b1f9
3
  size 272123144
runs/Jan03_15-20-03_Noah-Desktop/events.out.tfevents.1704316806.Noah-Desktop.14824.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e38aa9f49ec1f2a205086f56dc832d0e8e2b22c3c100c5e7cf0aad23901b583f
3
- size 65148
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5cf335bcada1453902ebf3bffd48d7898f3c1497effc830e8a806db9e5aed18
3
+ size 74522
runs/Jan03_15-20-03_Noah-Desktop/events.out.tfevents.1704321160.Noah-Desktop.14824.1 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:da3ffe8700f8f06fe9b1f217b509e0a10232e0d30bf1adb77c2b7c4e4f670917
3
+ size 359