Aleksandar commited on
Commit
139e999
1 Parent(s): 24fd3f5
Files changed (4) hide show
  1. README.md +13 -18
  2. config.json +10 -10
  3. pytorch_model.bin +1 -1
  4. training_args.bin +1 -1
README.md CHANGED
@@ -17,7 +17,7 @@ model_index:
17
  metric:
18
  name: Accuracy
19
  type: accuracy
20
- value: 0.951370041268543
21
  ---
22
 
23
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -27,11 +27,11 @@ should probably proofread and complete it, then remove this comment. -->
27
 
28
  This model was trained from scratch on the None dataset.
29
  It achieves the following results on the evaluation set:
30
- - Loss: 0.2619
31
- - Precision: 0.8157
32
- - Recall: 0.7934
33
- - F1: 0.8044
34
- - Accuracy: 0.9514
35
 
36
  ## Model description
37
 
@@ -51,27 +51,22 @@ More information needed
51
 
52
  The following hyperparameters were used during training:
53
  - learning_rate: 2e-05
54
- - train_batch_size: 16
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: linear
59
- - num_epochs: 10
60
 
61
  ### Training results
62
 
63
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
64
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
65
- | No log | 1.0 | 207 | 0.2845 | 0.7431 | 0.6314 | 0.6827 | 0.9225 |
66
- | No log | 2.0 | 414 | 0.2082 | 0.7766 | 0.7134 | 0.7436 | 0.9396 |
67
- | 0.2949 | 3.0 | 621 | 0.1992 | 0.7699 | 0.7596 | 0.7647 | 0.9439 |
68
- | 0.2949 | 4.0 | 828 | 0.2044 | 0.7485 | 0.7908 | 0.7691 | 0.9456 |
69
- | 0.0896 | 5.0 | 1035 | 0.2129 | 0.7827 | 0.7778 | 0.7802 | 0.9476 |
70
- | 0.0896 | 6.0 | 1242 | 0.2330 | 0.7893 | 0.7882 | 0.7887 | 0.9485 |
71
- | 0.0896 | 7.0 | 1449 | 0.2337 | 0.8026 | 0.7947 | 0.7986 | 0.9504 |
72
- | 0.0334 | 8.0 | 1656 | 0.2579 | 0.8111 | 0.7850 | 0.7978 | 0.9503 |
73
- | 0.0334 | 9.0 | 1863 | 0.2792 | 0.8263 | 0.7830 | 0.8041 | 0.9510 |
74
- | 0.0152 | 10.0 | 2070 | 0.2619 | 0.8157 | 0.7934 | 0.8044 | 0.9514 |
75
 
76
 
77
  ### Framework versions
 
17
  metric:
18
  name: Accuracy
19
  type: accuracy
20
+ value: 0.9411086738297951
21
  ---
22
 
23
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
27
 
28
  This model was trained from scratch on the None dataset.
29
  It achieves the following results on the evaluation set:
30
+ - Loss: 0.2071
31
+ - Precision: 0.7502
32
+ - Recall: 0.7385
33
+ - F1: 0.7443
34
+ - Accuracy: 0.9411
35
 
36
  ## Model description
37
 
 
51
 
52
  The following hyperparameters were used during training:
53
  - learning_rate: 2e-05
54
+ - train_batch_size: 32
55
  - eval_batch_size: 8
56
  - seed: 42
57
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
58
  - lr_scheduler_type: linear
59
+ - num_epochs: 5
60
 
61
  ### Training results
62
 
63
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
64
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
65
+ | No log | 1.0 | 104 | 0.3002 | 0.6859 | 0.5930 | 0.6361 | 0.9171 |
66
+ | No log | 2.0 | 208 | 0.2449 | 0.7509 | 0.6422 | 0.6923 | 0.9287 |
67
+ | No log | 3.0 | 312 | 0.2165 | 0.7557 | 0.7062 | 0.7301 | 0.9378 |
68
+ | No log | 4.0 | 416 | 0.2148 | 0.7402 | 0.7398 | 0.7400 | 0.9388 |
69
+ | 0.2565 | 5.0 | 520 | 0.2071 | 0.7502 | 0.7385 | 0.7443 | 0.9411 |
 
 
 
 
 
70
 
71
 
72
  ### Framework versions
config.json CHANGED
@@ -9,16 +9,16 @@
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
- "0": "LABEL_0",
13
- "1": "LABEL_1",
14
- "2": "LABEL_2",
15
- "3": "LABEL_3",
16
- "4": "LABEL_4",
17
- "5": "LABEL_5",
18
- "6": "LABEL_6",
19
- "7": "LABEL_7",
20
- "8": "LABEL_8",
21
- "9": "LABEL_9"
22
  },
23
  "initializer_range": 0.02,
24
  "intermediate_size": 3072,
 
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
+ "0": "O",
13
+ "1": "B-per",
14
+ "2": "I-per",
15
+ "3": "B-org",
16
+ "4": "I-org",
17
+ "5": "B-loc",
18
+ "6": "I-loc",
19
+ "7": "B-misc",
20
+ "8": "I-misc",
21
+ "9": "B-deriv-per"
22
  },
23
  "initializer_range": 0.02,
24
  "intermediate_size": 3072,
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:408ab60347d2161cf875856c1d42954cece5df1493039a9b4a441cafda01ea18
3
  size 435681969
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec7b9a76d0659ac4fa7dfd90789ea31addd81861c9cb485282eb9607256138da
3
  size 435681969
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:efb6091d29981239ada696d08d7ffe35b489163a49f6c6e0462f88e10f9294eb
3
  size 2671
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:6b35accb70711230c2bc8838c7735dc30698cdca306d100123e49278684e1394
3
  size 2671