Aleksandar commited on
Commit
66caa2f
1 Parent(s): 3128e5f
Files changed (4) hide show
  1. README.md +14 -20
  2. config.json +7 -7
  3. pytorch_model.bin +1 -1
  4. training_args.bin +1 -1
README.md CHANGED
@@ -21,7 +21,7 @@ model_index:
21
  metric:
22
  name: Accuracy
23
  type: accuracy
24
- value: 0.95641898994996
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -31,11 +31,11 @@ should probably proofread and complete it, then remove this comment. -->
31
 
32
  This model was trained from scratch on the wikiann dataset.
33
  It achieves the following results on the evaluation set:
34
- - Loss: 0.3017
35
- - Precision: 0.8911
36
- - Recall: 0.9081
37
- - F1: 0.8995
38
- - Accuracy: 0.9564
39
 
40
  ## Model description
41
 
@@ -55,27 +55,21 @@ More information needed
55
 
56
  The following hyperparameters were used during training:
57
  - learning_rate: 2e-05
58
- - train_batch_size: 16
59
  - eval_batch_size: 8
60
  - seed: 42
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
- - num_epochs: 10
64
 
65
  ### Training results
66
 
67
- | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
68
- |:-------------:|:-----:|:-----:|:---------------:|:---------:|:------:|:------:|:--------:|
69
- | 0.2535 | 1.0 | 1250 | 0.2015 | 0.8494 | 0.8605 | 0.8549 | 0.9376 |
70
- | 0.1461 | 2.0 | 2500 | 0.1853 | 0.8800 | 0.8681 | 0.8740 | 0.9464 |
71
- | 0.0914 | 3.0 | 3750 | 0.2022 | 0.8695 | 0.8912 | 0.8802 | 0.9485 |
72
- | 0.0545 | 4.0 | 5000 | 0.2214 | 0.8758 | 0.8975 | 0.8865 | 0.9514 |
73
- | 0.0385 | 5.0 | 6250 | 0.2536 | 0.8806 | 0.9010 | 0.8907 | 0.9523 |
74
- | 0.0266 | 6.0 | 7500 | 0.2506 | 0.8834 | 0.9020 | 0.8926 | 0.9539 |
75
- | 0.0133 | 7.0 | 8750 | 0.2745 | 0.8910 | 0.9057 | 0.8983 | 0.9562 |
76
- | 0.0077 | 8.0 | 10000 | 0.2946 | 0.8872 | 0.9065 | 0.8968 | 0.9559 |
77
- | 0.0043 | 9.0 | 11250 | 0.2931 | 0.8902 | 0.9094 | 0.8997 | 0.9567 |
78
- | 0.0022 | 10.0 | 12500 | 0.3017 | 0.8911 | 0.9081 | 0.8995 | 0.9564 |
79
 
80
 
81
  ### Framework versions
21
  metric:
22
  name: Accuracy
23
  type: accuracy
24
+ value: 0.9500777931962491
25
  ---
26
 
27
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
31
 
32
  This model was trained from scratch on the wikiann dataset.
33
  It achieves the following results on the evaluation set:
34
+ - Loss: 0.1859
35
+ - Precision: 0.8742
36
+ - Recall: 0.8907
37
+ - F1: 0.8824
38
+ - Accuracy: 0.9501
39
 
40
  ## Model description
41
 
55
 
56
  The following hyperparameters were used during training:
57
  - learning_rate: 2e-05
58
+ - train_batch_size: 32
59
  - eval_batch_size: 8
60
  - seed: 42
61
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
62
  - lr_scheduler_type: linear
63
+ - num_epochs: 4
64
 
65
  ### Training results
66
 
67
+ | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
68
+ |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
69
+ | 0.3627 | 1.0 | 625 | 0.2077 | 0.8382 | 0.8545 | 0.8463 | 0.9349 |
70
+ | 0.1894 | 2.0 | 1250 | 0.1764 | 0.8640 | 0.8760 | 0.8700 | 0.9453 |
71
+ | 0.1326 | 3.0 | 1875 | 0.1848 | 0.8618 | 0.8873 | 0.8744 | 0.9473 |
72
+ | 0.0712 | 4.0 | 2500 | 0.1859 | 0.8742 | 0.8907 | 0.8824 | 0.9501 |
 
 
 
 
 
 
73
 
74
 
75
  ### Framework versions
config.json CHANGED
@@ -9,13 +9,13 @@
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
- "0": "LABEL_0",
13
- "1": "LABEL_1",
14
- "2": "LABEL_2",
15
- "3": "LABEL_3",
16
- "4": "LABEL_4",
17
- "5": "LABEL_5",
18
- "6": "LABEL_6"
19
  },
20
  "initializer_range": 0.02,
21
  "intermediate_size": 3072,
9
  "hidden_dropout_prob": 0.1,
10
  "hidden_size": 768,
11
  "id2label": {
12
+ "0": "O",
13
+ "1": "B-per",
14
+ "2": "I-per",
15
+ "3": "B-org",
16
+ "4": "I-org",
17
+ "5": "B-loc",
18
+ "6": "I-loc"
19
  },
20
  "initializer_range": 0.02,
21
  "intermediate_size": 3072,
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:b7d8a1f36e88c5b01779347014202b894d16581de832b8ffd92fbe47819a491c
3
  size 435672753
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e704d485234a893ecf737c6e250580e9dcf0ef3e5a1e200e500e673215c275a4
3
  size 435672753
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8f81507c71bf9458f8c2acc88833a1e325129d1e0909b54f2b6d8c75619cc011
3
  size 2671
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b46a59921cb24bd5481f53740647d36b6e2bdc137016ef8859eb3e9045cdee26
3
  size 2671