OpOp1 commited on
Commit
2398d42
1 Parent(s): ced26a2

OpOp1/TI-GPT-2B-HWA

Browse files
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [google/gemma-2b-it](https://huggingface.co/google/gemma-2b-it) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 5.0192
20
 
21
  ## Model description
22
 
@@ -51,14 +51,16 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
- | 7.1896 | 0.8 | 1 | 5.7603 |
55
- | 7.1651 | 1.6 | 2 | 5.7046 |
56
- | 7.0451 | 2.4 | 3 | 5.5801 |
57
- | 3.4235 | 4.0 | 5 | 5.3421 |
58
- | 6.6436 | 4.8 | 6 | 5.2394 |
59
- | 6.5021 | 5.6 | 7 | 5.1538 |
60
- | 6.3848 | 6.4 | 8 | 5.0876 |
61
- | 3.1579 | 8.0 | 10 | 5.0192 |
 
 
62
 
63
 
64
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [google/gemma-2b-it](https://huggingface.co/google/gemma-2b-it) on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.7817
20
 
21
  ## Model description
22
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss |
53
  |:-------------:|:-----:|:----:|:---------------:|
54
+ | 4.063 | 1.0 | 10 | 3.5780 |
55
+ | 3.31 | 2.0 | 20 | 3.0259 |
56
+ | 2.8245 | 3.0 | 30 | 2.5810 |
57
+ | 2.4092 | 4.0 | 40 | 2.2151 |
58
+ | 2.1057 | 5.0 | 50 | 1.9864 |
59
+ | 1.9341 | 6.0 | 60 | 1.8753 |
60
+ | 1.8583 | 7.0 | 70 | 1.8140 |
61
+ | 1.7906 | 8.0 | 80 | 1.7611 |
62
+ | 1.7858 | 9.0 | 90 | 1.7852 |
63
+ | 1.7948 | 10.0 | 100 | 1.7817 |
64
 
65
 
66
  ### Framework versions
adapter_model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:24a0e70dd39b493fb6520802f934ad0445682e461f763d84ddd0a0c7291cce1e
3
  size 2364032
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a093ecf135bad9fd1e1ebb7c3ea83e0e0af4f097b473775303233f1539a943d3
3
  size 2364032
runs/Apr02_20-22-08_c750c11bcf60/events.out.tfevents.1712089329.c750c11bcf60.3526.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:dd04a7876de012ae9aafa622562410ad9fe1ab72001049534caccb62b461d1f7
3
+ size 10092
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5872591beedd8a8ad47681cd61239875fa31563b5f21f4b627aabcb71933e635
3
  size 4856
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:cc24de6a7360e87caadd0feaf93abc73aaa0d2eacf9acbb786807cc86a0012f4
3
  size 4856