mehdie/fine_tuned_mBERT
Browse files- README.md +15 -15
- model.safetensors +1 -1
- runs/Apr30_11-25-28_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469129.a256-a40-06.srv.aau.dk.1194321.0 +3 -0
- runs/Apr30_11-25-28_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469190.a256-a40-06.srv.aau.dk.1194321.1 +3 -0
- runs/Apr30_11-27-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469248.a256-a40-06.srv.aau.dk.1194904.0 +3 -0
- runs/Apr30_11-27-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469298.a256-a40-06.srv.aau.dk.1194904.1 +3 -0
- runs/Apr30_11-28-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469338.a256-a40-06.srv.aau.dk.1195214.0 +3 -0
- runs/Apr30_11-28-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469376.a256-a40-06.srv.aau.dk.1195214.1 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -19,11 +19,11 @@ should probably proofread and complete it, then remove this comment. -->
|
|
19 |
|
20 |
This model is a fine-tuned version of [google-bert/bert-base-multilingual-cased](https://huggingface.co/google-bert/bert-base-multilingual-cased) on an unknown dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
-
- Loss: 0.
|
23 |
-
- F1: 0.
|
24 |
-
- F5: 0.
|
25 |
-
- Precision: 0.
|
26 |
-
- Recall: 0
|
27 |
|
28 |
## Model description
|
29 |
|
@@ -42,7 +42,7 @@ More information needed
|
|
42 |
### Training hyperparameters
|
43 |
|
44 |
The following hyperparameters were used during training:
|
45 |
-
- learning_rate: 5e-05
|
46 |
- train_batch_size: 128
|
47 |
- eval_batch_size: 128
|
48 |
- seed: 42
|
@@ -55,15 +55,15 @@ The following hyperparameters were used during training:
|
|
55 |
|
56 |
| Training Loss | Epoch | Step | Validation Loss | F1 | F5 | Precision | Recall |
|
57 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:---------:|:------:|
|
58 |
-
| No log | 1.0 |
|
59 |
-
| No log | 2.0 |
|
60 |
-
| No log | 3.0 |
|
61 |
-
| No log | 4.0 |
|
62 |
-
| No log | 5.0 |
|
63 |
-
| No log | 6.0 |
|
64 |
-
| No log | 7.0 |
|
65 |
-
| No log | 8.0 |
|
66 |
-
| No log | 9.0 |
|
67 |
|
68 |
|
69 |
### Framework versions
|
|
|
19 |
|
20 |
This model is a fine-tuned version of [google-bert/bert-base-multilingual-cased](https://huggingface.co/google-bert/bert-base-multilingual-cased) on an unknown dataset.
|
21 |
It achieves the following results on the evaluation set:
|
22 |
+
- Loss: 0.0431
|
23 |
+
- F1: 0.8182
|
24 |
+
- F5: 0.8792
|
25 |
+
- Precision: 0.6923
|
26 |
+
- Recall: 1.0
|
27 |
|
28 |
## Model description
|
29 |
|
|
|
42 |
### Training hyperparameters
|
43 |
|
44 |
The following hyperparameters were used during training:
|
45 |
+
- learning_rate: 2.5e-05
|
46 |
- train_batch_size: 128
|
47 |
- eval_batch_size: 128
|
48 |
- seed: 42
|
|
|
55 |
|
56 |
| Training Loss | Epoch | Step | Validation Loss | F1 | F5 | Precision | Recall |
|
57 |
|:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:---------:|:------:|
|
58 |
+
| No log | 1.0 | 16 | 0.2406 | 0.0 | 0.0 | 0.0 | 0.0 |
|
59 |
+
| No log | 2.0 | 32 | 0.2933 | 0.6471 | 0.6062 | 0.7857 | 0.55 |
|
60 |
+
| No log | 3.0 | 48 | 0.1965 | 0.5000 | 0.4297 | 0.875 | 0.35 |
|
61 |
+
| No log | 4.0 | 64 | 0.1349 | 0.6842 | 0.6707 | 0.7222 | 0.65 |
|
62 |
+
| No log | 5.0 | 80 | 0.1065 | 0.7027 | 0.6816 | 0.7647 | 0.65 |
|
63 |
+
| No log | 6.0 | 96 | 0.1104 | 0.7727 | 0.8005 | 0.7083 | 0.85 |
|
64 |
+
| No log | 7.0 | 112 | 0.1160 | 0.7273 | 0.7534 | 0.6667 | 0.8 |
|
65 |
+
| No log | 8.0 | 128 | 0.1049 | 0.7647 | 0.7164 | 0.9286 | 0.65 |
|
66 |
+
| No log | 9.0 | 144 | 0.0975 | 0.7778 | 0.7461 | 0.875 | 0.7 |
|
67 |
|
68 |
|
69 |
### Framework versions
|
model.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 711443456
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:02af24f8cb3357f39b2dc0f08945141b588bfcadbc3544e6931cdd7f2d3fce25
|
3 |
size 711443456
|
runs/Apr30_11-25-28_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469129.a256-a40-06.srv.aau.dk.1194321.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:e4572ca7634f189a20b2695306489715c705d62a8f3879063dda02f680ceb008
|
3 |
+
size 12093
|
runs/Apr30_11-25-28_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469190.a256-a40-06.srv.aau.dk.1194321.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:adccb31793a98f77c9c14ec3dc4106cf8ccc7463187d289fc9c0200cae2fa4ed
|
3 |
+
size 506
|
runs/Apr30_11-27-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469248.a256-a40-06.srv.aau.dk.1194904.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:29e46f3fd5c77e2f4e943536faf6ac8b29ba2f671b765bd3dd573dd528a73510
|
3 |
+
size 10695
|
runs/Apr30_11-27-27_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469298.a256-a40-06.srv.aau.dk.1194904.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:d12beb3326cb0bce90a2bf1aa2e7b3bf791e891c59eebd86774cd274e133448e
|
3 |
+
size 506
|
runs/Apr30_11-28-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469338.a256-a40-06.srv.aau.dk.1195214.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a4f9085b6d438b4d3f9edf6c4fdbc4e63b9fec5d5406793ce1eabaf783206898
|
3 |
+
size 9296
|
runs/Apr30_11-28-57_a256-a40-06.srv.aau.dk/events.out.tfevents.1714469376.a256-a40-06.srv.aau.dk.1195214.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:713442e3bd4ee48bd945577a330500d707409e00c126005a6a9f517874878e80
|
3 |
+
size 506
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4920
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ab980b1abcf7e240f9433a8c677a2d39e58cbba2282336155fa69ce0142fcec1
|
3 |
size 4920
|