graphcore-rahult commited on
Commit
b7ac2c2
1 Parent(s): b400a1d

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +10 -10
README.md CHANGED
@@ -21,11 +21,11 @@ should probably proofread and complete it, then remove this comment. -->
21
 
22
  This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the conll2003 dataset.
23
  It achieves the following results on the evaluation set:
24
- - Loss: 0.0714
25
- - Precision: 0.8968
26
- - Recall: 0.9138
27
- - F1: 0.9052
28
- - Accuracy: 0.9796
29
 
30
  ## Model description
31
 
@@ -50,8 +50,8 @@ The following hyperparameters were used during training:
50
  - seed: 42
51
  - distributed_type: IPU
52
  - gradient_accumulation_steps: 16
53
- - total_train_batch_size: 64
54
- - total_eval_batch_size: 20
55
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
56
  - lr_scheduler_type: linear
57
  - num_epochs: 3
@@ -61,9 +61,9 @@ The following hyperparameters were used during training:
61
 
62
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
63
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
64
- | 0.1048 | 1.0 | 219 | 0.0948 | 0.8606 | 0.8813 | 0.8708 | 0.9734 |
65
- | 0.1062 | 2.0 | 438 | 0.0742 | 0.8964 | 0.9072 | 0.9018 | 0.9789 |
66
- | 0.0458 | 3.0 | 657 | 0.0714 | 0.8968 | 0.9138 | 0.9052 | 0.9796 |
67
 
68
 
69
  ### Framework versions
 
21
 
22
  This model is a fine-tuned version of [bert-base-uncased](https://huggingface.co/bert-base-uncased) on the conll2003 dataset.
23
  It achieves the following results on the evaluation set:
24
+ - Loss: 0.0626
25
+ - Precision: 0.9201
26
+ - Recall: 0.9350
27
+ - F1: 0.9275
28
+ - Accuracy: 0.9832
29
 
30
  ## Model description
31
 
 
50
  - seed: 42
51
  - distributed_type: IPU
52
  - gradient_accumulation_steps: 16
53
+ - total_train_batch_size: 16
54
+ - total_eval_batch_size: 5
55
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
56
  - lr_scheduler_type: linear
57
  - num_epochs: 3
 
61
 
62
  | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1 | Accuracy |
63
  |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
64
+ | 0.0943 | 1.0 | 877 | 0.0687 | 0.9019 | 0.9149 | 0.9084 | 0.9801 |
65
+ | 0.2395 | 2.0 | 1754 | 0.0623 | 0.9221 | 0.9298 | 0.9259 | 0.9829 |
66
+ | 0.0241 | 3.0 | 2631 | 0.0626 | 0.9201 | 0.9350 | 0.9275 | 0.9832 |
67
 
68
 
69
  ### Framework versions