Model save
Browse files- README.md +18 -18
- all_results.json +4 -4
- model-00001-of-00003.safetensors +1 -1
- model-00002-of-00003.safetensors +1 -1
- model-00003-of-00003.safetensors +1 -1
- runs/May10_12-22-14_n136-082-130/events.out.tfevents.1715315025.n136-082-130.295839.0 +2 -2
- train_results.json +4 -4
- trainer_state.json +0 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -15,15 +15,15 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
|
16 |
This model was trained from scratch on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
- Loss: 1.
|
19 |
-
- Rewards/chosen: -
|
20 |
-
- Rewards/rejected: -
|
21 |
-
- Rewards/accuracies: 0.
|
22 |
-
- Rewards/margins:
|
23 |
-
- Logps/rejected: -
|
24 |
-
- Logps/chosen: -
|
25 |
-
- Logits/rejected:
|
26 |
-
- Logits/chosen:
|
27 |
|
28 |
## Model description
|
29 |
|
@@ -60,15 +60,15 @@ The following hyperparameters were used during training:
|
|
60 |
|
61 |
| Training Loss | Epoch | Step | Logits/chosen | Logits/rejected | Logps/chosen | Logps/rejected | Validation Loss | Rewards/accuracies | Rewards/chosen | Rewards/margins | Rewards/rejected |
|
62 |
|:-------------:|:-----:|:----:|:-------------:|:---------------:|:------------:|:--------------:|:---------------:|:------------------:|:--------------:|:---------------:|:----------------:|
|
63 |
-
| 0.
|
64 |
-
| 0.
|
65 |
-
| 0.
|
66 |
-
| 0.
|
67 |
-
| 0.
|
68 |
-
| 0.
|
69 |
-
| 0.
|
70 |
-
| 0.
|
71 |
-
| 0.
|
72 |
|
73 |
|
74 |
### Framework versions
|
|
|
15 |
|
16 |
This model was trained from scratch on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- Loss: 1.4971
|
19 |
+
- Rewards/chosen: -4.5102
|
20 |
+
- Rewards/rejected: -4.6591
|
21 |
+
- Rewards/accuracies: 0.5156
|
22 |
+
- Rewards/margins: 0.1490
|
23 |
+
- Logps/rejected: -753.6738
|
24 |
+
- Logps/chosen: -732.6489
|
25 |
+
- Logits/rejected: 1.5926
|
26 |
+
- Logits/chosen: 1.5057
|
27 |
|
28 |
## Model description
|
29 |
|
|
|
60 |
|
61 |
| Training Loss | Epoch | Step | Logits/chosen | Logits/rejected | Logps/chosen | Logps/rejected | Validation Loss | Rewards/accuracies | Rewards/chosen | Rewards/margins | Rewards/rejected |
|
62 |
|:-------------:|:-----:|:----:|:-------------:|:---------------:|:------------:|:--------------:|:---------------:|:------------------:|:--------------:|:---------------:|:----------------:|
|
63 |
+
| 0.2876 | 0.1 | 100 | -2.3965 | -2.3559 | -391.6134 | -394.6287 | 0.8317 | 0.4883 | -1.0998 | -0.0311 | -1.0687 |
|
64 |
+
| 0.1728 | 0.21 | 200 | -0.2344 | -0.1269 | -464.6779 | -471.8403 | 1.0232 | 0.4766 | -1.8304 | 0.0103 | -1.8408 |
|
65 |
+
| 0.1485 | 0.31 | 300 | -0.3320 | -0.2139 | -506.0840 | -508.1475 | 1.1085 | 0.4883 | -2.2445 | -0.0406 | -2.2039 |
|
66 |
+
| 0.1363 | 0.42 | 400 | -0.2901 | -0.1728 | -477.3530 | -486.5422 | 1.1616 | 0.4961 | -1.9572 | 0.0306 | -1.9878 |
|
67 |
+
| 0.1192 | 0.52 | 500 | 0.8077 | 0.8821 | -553.1240 | -562.3370 | 1.2602 | 0.4961 | -2.7149 | 0.0308 | -2.7458 |
|
68 |
+
| 0.1061 | 0.63 | 600 | 1.3570 | -3.5510 | -3.6801 | 0.5078 | 0.1291 | -655.7740 | -636.7335 | 1.4624 | 1.3499 |
|
69 |
+
| 0.0916 | 0.73 | 700 | 1.5923 | -4.9928 | -5.1535 | 0.5195 | 0.1607 | -803.1144 | -780.9122 | 1.8370 | 1.7244 |
|
70 |
+
| 0.0982 | 0.84 | 800 | 1.4367 | -4.1982 | -4.3446 | 0.5117 | 0.1464 | -722.2228 | -701.4560 | 1.5885 | 1.4960 |
|
71 |
+
| 0.0798 | 0.94 | 900 | 1.4971 | -4.5102 | -4.6591 | 0.5156 | 0.1490 | -753.6738 | -732.6489 | 1.5926 | 1.5057 |
|
72 |
|
73 |
|
74 |
### Framework versions
|
all_results.json
CHANGED
@@ -1,8 +1,8 @@
|
|
1 |
{
|
2 |
"epoch": 1.0,
|
3 |
-
"train_loss": 0.
|
4 |
-
"train_runtime":
|
5 |
"train_samples": 122268,
|
6 |
-
"train_samples_per_second": 16.
|
7 |
-
"train_steps_per_second": 0.
|
8 |
}
|
|
|
1 |
{
|
2 |
"epoch": 1.0,
|
3 |
+
"train_loss": 0.04742721384732511,
|
4 |
+
"train_runtime": 7350.5302,
|
5 |
"train_samples": 122268,
|
6 |
+
"train_samples_per_second": 16.634,
|
7 |
+
"train_steps_per_second": 0.13
|
8 |
}
|
model-00001-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4943178720
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:cd0d1884a8b45d0de1e90c92985f8203fb2783935e71bdd96bd796f278e035d6
|
3 |
size 4943178720
|
model-00002-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4999819336
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:558503735ec0abd267f67456472c8c72174e08e0fd01a475d9d1eb983c4283b8
|
3 |
size 4999819336
|
model-00003-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4540532728
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:95eef01440ee3fb063a713d4575231456ee2001b5ff7505be8157f14ddf6e624
|
3 |
size 4540532728
|
runs/May10_12-22-14_n136-082-130/events.out.tfevents.1715315025.n136-082-130.295839.0
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
-
size
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c13d588efda433465f94acfdafa33e3d9868bfd9011d3edcc521950ee2b60339
|
3 |
+
size 39189
|
train_results.json
CHANGED
@@ -1,8 +1,8 @@
|
|
1 |
{
|
2 |
"epoch": 1.0,
|
3 |
-
"train_loss": 0.
|
4 |
-
"train_runtime":
|
5 |
"train_samples": 122268,
|
6 |
-
"train_samples_per_second": 16.
|
7 |
-
"train_steps_per_second": 0.
|
8 |
}
|
|
|
1 |
{
|
2 |
"epoch": 1.0,
|
3 |
+
"train_loss": 0.04742721384732511,
|
4 |
+
"train_runtime": 7350.5302,
|
5 |
"train_samples": 122268,
|
6 |
+
"train_samples_per_second": 16.634,
|
7 |
+
"train_steps_per_second": 0.13
|
8 |
}
|
trainer_state.json
CHANGED
The diff for this file is too large to render.
See raw diff
|
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 6264
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:291902e71a2a4585a72bed7e38f885273985e04787086b5e9c772897150379c6
|
3 |
size 6264
|