yemen2016 commited on
Commit
dfaab70
1 Parent(s): 15bcf19

End of training

Browse files
README.md CHANGED
@@ -1,13 +1,13 @@
1
  ---
2
- base_model: KennethEnevoldsen/dfm-sentence-encoder-large-exp2-no-lang-align
3
  library_name: transformers
 
 
 
4
  metrics:
5
  - accuracy
6
  - precision
7
  - recall
8
  - f1
9
- tags:
10
- - generated_from_trainer
11
  model-index:
12
  - name: dfm
13
  results: []
@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [KennethEnevoldsen/dfm-sentence-encoder-large-exp2-no-lang-align](https://huggingface.co/KennethEnevoldsen/dfm-sentence-encoder-large-exp2-no-lang-align) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
- - Accuracy: 0.9421
24
- - Precision: 0.9470
25
- - Recall: 0.9421
26
- - F1: 0.9422
27
- - Loss: 0.5839
28
 
29
  ## Model description
30
 
@@ -58,25 +58,25 @@ The following hyperparameters were used during training:
58
 
59
  | Training Loss | Epoch | Step | Accuracy | Precision | Recall | F1 | Validation Loss |
60
  |:-------------:|:-------:|:----:|:--------:|:---------:|:------:|:------:|:---------------:|
61
- | No log | 0.9412 | 8 | 0.8711 | 0.8341 | 0.8711 | 0.8507 | 0.4719 |
62
- | No log | 2.0 | 17 | 0.9237 | 0.9242 | 0.9237 | 0.9217 | 0.3301 |
63
- | No log | 2.9412 | 25 | 0.9225 | 0.9301 | 0.9225 | 0.9232 | 0.3470 |
64
- | No log | 4.0 | 34 | 0.9317 | 0.9315 | 0.9317 | 0.9299 | 0.2004 |
65
- | No log | 4.9412 | 42 | 0.9379 | 0.9443 | 0.9379 | 0.9383 | 0.4529 |
66
- | No log | 6.0 | 51 | 0.9394 | 0.9454 | 0.9394 | 0.9399 | 0.4719 |
67
- | No log | 6.9412 | 59 | 0.9425 | 0.9458 | 0.9425 | 0.9419 | 0.4498 |
68
- | No log | 8.0 | 68 | 0.9421 | 0.9471 | 0.9421 | 0.9423 | 0.4921 |
69
- | No log | 8.9412 | 76 | 0.9440 | 0.9486 | 0.9440 | 0.9440 | 0.5242 |
70
- | No log | 10.0 | 85 | 0.9444 | 0.9488 | 0.9444 | 0.9443 | 0.5476 |
71
- | No log | 10.9412 | 93 | 0.9421 | 0.9471 | 0.9421 | 0.9422 | 0.5733 |
72
- | No log | 12.0 | 102 | 0.9432 | 0.9479 | 0.9432 | 0.9433 | 0.5725 |
73
- | No log | 12.9412 | 110 | 0.9432 | 0.9478 | 0.9432 | 0.9432 | 0.5677 |
74
- | No log | 14.0 | 119 | 0.9432 | 0.9478 | 0.9432 | 0.9432 | 0.5714 |
75
- | No log | 14.9412 | 127 | 0.9425 | 0.9473 | 0.9425 | 0.9425 | 0.5802 |
76
- | No log | 16.0 | 136 | 0.9417 | 0.9468 | 0.9417 | 0.9418 | 0.5838 |
77
- | No log | 16.9412 | 144 | 0.9421 | 0.9470 | 0.9421 | 0.9422 | 0.5857 |
78
- | No log | 18.0 | 153 | 0.9421 | 0.9470 | 0.9421 | 0.9422 | 0.5840 |
79
- | No log | 18.8235 | 160 | 0.9421 | 0.9470 | 0.9421 | 0.9422 | 0.5839 |
80
 
81
 
82
  ### Framework versions
 
1
  ---
 
2
  library_name: transformers
3
+ base_model: KennethEnevoldsen/dfm-sentence-encoder-large-exp2-no-lang-align
4
+ tags:
5
+ - generated_from_trainer
6
  metrics:
7
  - accuracy
8
  - precision
9
  - recall
10
  - f1
 
 
11
  model-index:
12
  - name: dfm
13
  results: []
 
20
 
21
  This model is a fine-tuned version of [KennethEnevoldsen/dfm-sentence-encoder-large-exp2-no-lang-align](https://huggingface.co/KennethEnevoldsen/dfm-sentence-encoder-large-exp2-no-lang-align) on the None dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Accuracy: 0.9417
24
+ - Precision: 0.9468
25
+ - Recall: 0.9417
26
+ - F1: 0.9418
27
+ - Loss: 0.4894
28
 
29
  ## Model description
30
 
 
58
 
59
  | Training Loss | Epoch | Step | Accuracy | Precision | Recall | F1 | Validation Loss |
60
  |:-------------:|:-------:|:----:|:--------:|:---------:|:------:|:------:|:---------------:|
61
+ | No log | 0.9412 | 8 | 0.7223 | 0.7770 | 0.7223 | 0.7069 | 0.8079 |
62
+ | No log | 2.0 | 17 | 0.7821 | 0.8280 | 0.7821 | 0.7670 | 0.7157 |
63
+ | No log | 2.9412 | 25 | 0.9217 | 0.9243 | 0.9217 | 0.9174 | 0.3617 |
64
+ | No log | 4.0 | 34 | 0.9283 | 0.9331 | 0.9283 | 0.9272 | 0.3444 |
65
+ | No log | 4.9412 | 42 | 0.9156 | 0.9274 | 0.9156 | 0.9168 | 0.4618 |
66
+ | No log | 6.0 | 51 | 0.9271 | 0.9316 | 0.9271 | 0.9277 | 0.3164 |
67
+ | No log | 6.9412 | 59 | 0.9356 | 0.9387 | 0.9356 | 0.9349 | 0.3228 |
68
+ | No log | 8.0 | 68 | 0.9329 | 0.9398 | 0.9329 | 0.9334 | 0.4814 |
69
+ | No log | 8.9412 | 76 | 0.9402 | 0.9450 | 0.9402 | 0.9400 | 0.4819 |
70
+ | No log | 10.0 | 85 | 0.9409 | 0.9459 | 0.9409 | 0.9409 | 0.4952 |
71
+ | No log | 10.9412 | 93 | 0.9367 | 0.9428 | 0.9367 | 0.9370 | 0.5182 |
72
+ | No log | 12.0 | 102 | 0.9409 | 0.9462 | 0.9409 | 0.9411 | 0.4947 |
73
+ | No log | 12.9412 | 110 | 0.9405 | 0.9457 | 0.9405 | 0.9406 | 0.4927 |
74
+ | No log | 14.0 | 119 | 0.9409 | 0.9462 | 0.9409 | 0.9411 | 0.4912 |
75
+ | No log | 14.9412 | 127 | 0.9413 | 0.9465 | 0.9413 | 0.9414 | 0.4917 |
76
+ | No log | 16.0 | 136 | 0.9413 | 0.9464 | 0.9413 | 0.9415 | 0.4893 |
77
+ | No log | 16.9412 | 144 | 0.9413 | 0.9464 | 0.9413 | 0.9415 | 0.4890 |
78
+ | No log | 18.0 | 153 | 0.9417 | 0.9468 | 0.9417 | 0.9418 | 0.4893 |
79
+ | No log | 18.8235 | 160 | 0.9417 | 0.9468 | 0.9417 | 0.9418 | 0.4894 |
80
 
81
 
82
  ### Framework versions
config.json CHANGED
@@ -10,20 +10,20 @@
10
  "hidden_dropout_prob": 0.1,
11
  "hidden_size": 1024,
12
  "id2label": {
13
- "0": "O",
14
- "1": "ST",
15
- "2": "SM",
16
- "3": "SP",
17
- "4": "_"
18
  },
19
  "initializer_range": 0.02,
20
  "intermediate_size": 4096,
21
  "label2id": {
22
- "O": 0,
23
- "ST": 1,
24
- "SM": 2,
25
- "SP": 3,
26
- "_": 4
27
  },
28
  "layer_norm_eps": 1e-12,
29
  "max_position_embeddings": 512,
 
10
  "hidden_dropout_prob": 0.1,
11
  "hidden_size": 1024,
12
  "id2label": {
13
+ "0": "LABEL_0",
14
+ "1": "LABEL_1",
15
+ "2": "LABEL_2",
16
+ "3": "LABEL_3",
17
+ "4": "LABEL_4"
18
  },
19
  "initializer_range": 0.02,
20
  "intermediate_size": 4096,
21
  "label2id": {
22
+ "LABEL_0": 0,
23
+ "LABEL_1": 1,
24
+ "LABEL_2": 2,
25
+ "LABEL_3": 3,
26
+ "LABEL_4": 4
27
  },
28
  "layer_norm_eps": 1e-12,
29
  "max_position_embeddings": 512,
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:003d33f76a410852ef6d48ebe2ff12b18fe2ee0b259b8a6ea78be82c83e89c61
3
  size 1416218404
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0bf7ffc5a38d4ad76a8778993472d2c46beabeb3ff1a1be46ab4593999ba4681
3
  size 1416218404
runs/Oct23_12-07-26_08287b92a2e8/events.out.tfevents.1729685247.08287b92a2e8.40887.7 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:93365fdc740a55ec10abd3532d3b188afd2c88d158108a4ba053080cdae11a23
3
+ size 5403
runs/Oct23_12-18-21_08287b92a2e8/events.out.tfevents.1729685902.08287b92a2e8.40887.11 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a7e4e5014e0fcb1b7f542bac98ecc7ca9c85a3b4d65bf37f462da01b0657ee77
3
+ size 14512
runs/Oct23_12-18-21_08287b92a2e8/events.out.tfevents.1729685999.08287b92a2e8.40887.12 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af0899f9b57d2dbf559ee3b21f1dcc7573d1ee2d1a2adb93d6ba84c470dc6a92
3
+ size 560
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c16198da7ff65e77dab30111090fab65628f4b4c7c09910503f3502722bd53b1
3
  size 5240
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:074b9f52bb57b9c68f0f2260d2e40fcb63a34ab6b6314af5c5f4f91facaf7c02
3
  size 5240