graphcore-rahult commited on
Commit
8544c7f
1 Parent(s): c25637e

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 2.5273
20
 
21
  ## Model description
22
 
@@ -41,8 +41,8 @@ The following hyperparameters were used during training:
41
  - seed: 42
42
  - distributed_type: IPU
43
  - gradient_accumulation_steps: 16
44
- - total_train_batch_size: 64
45
- - total_eval_batch_size: 20
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - num_epochs: 1
@@ -50,9 +50,9 @@ The following hyperparameters were used during training:
50
 
51
  ### Training results
52
 
53
- | Training Loss | Epoch | Step | Validation Loss |
54
- |:-------------:|:-----:|:----:|:---------------:|
55
- | 2.6962 | 1.0 | 3188 | 2.5273 |
56
 
57
 
58
  ### Framework versions
 
16
 
17
  This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the xsum dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 2.4688
20
 
21
  ## Model description
22
 
 
41
  - seed: 42
42
  - distributed_type: IPU
43
  - gradient_accumulation_steps: 16
44
+ - total_train_batch_size: 16
45
+ - total_eval_batch_size: 5
46
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
47
  - lr_scheduler_type: linear
48
  - num_epochs: 1
 
50
 
51
  ### Training results
52
 
53
+ | Training Loss | Epoch | Step | Validation Loss |
54
+ |:-------------:|:-----:|:-----:|:---------------:|
55
+ | 2.7197 | 1.0 | 12752 | 2.4688 |
56
 
57
 
58
  ### Framework versions