milyiyo commited on
Commit
02df826
1 Parent(s): 25bfeb1

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -25
README.md CHANGED
@@ -14,7 +14,12 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [flax-community/spanish-t5-small](https://huggingface.co/flax-community/spanish-t5-small) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 3.1909
 
 
 
 
 
18
 
19
  ## Model description
20
 
@@ -34,33 +39,12 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 2e-05
37
- - train_batch_size: 12
38
- - eval_batch_size: 12
39
  - seed: 42
40
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
41
  - lr_scheduler_type: linear
42
- - num_epochs: 15
43
-
44
- ### Training results
45
-
46
- | Training Loss | Epoch | Step | Validation Loss |
47
- |:-------------:|:-----:|:-----:|:---------------:|
48
- | 3.5448 | 1.0 | 3347 | 3.4082 |
49
- | 3.4232 | 2.0 | 6694 | 3.3411 |
50
- | 3.3338 | 3.0 | 10041 | 3.2953 |
51
- | 3.2769 | 4.0 | 13388 | 3.2608 |
52
- | 3.2347 | 5.0 | 16735 | 3.2417 |
53
- | 3.1921 | 6.0 | 20082 | 3.2319 |
54
- | 3.1512 | 7.0 | 23429 | 3.2162 |
55
- | 3.1188 | 8.0 | 26776 | 3.2117 |
56
- | 3.1045 | 9.0 | 30123 | 3.2022 |
57
- | 3.0718 | 10.0 | 33470 | 3.1961 |
58
- | 3.0644 | 11.0 | 36817 | 3.1928 |
59
- | 3.0403 | 12.0 | 40164 | 3.1922 |
60
- | 3.0332 | 13.0 | 43511 | 3.1915 |
61
- | 3.0225 | 14.0 | 46858 | 3.1899 |
62
- | 3.0066 | 15.0 | 50205 | 3.1909 |
63
-
64
 
65
  ### Framework versions
66
 
 
14
 
15
  This model is a fine-tuned version of [flax-community/spanish-t5-small](https://huggingface.co/flax-community/spanish-t5-small) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - eval_loss: 2.9602
18
+ - eval_runtime: 124.6439
19
+ - eval_samples_per_second: 219.369
20
+ - eval_steps_per_second: 54.844
21
+ - epoch: 1.1
22
+ - step: 67373
23
 
24
  ## Model description
25
 
 
39
 
40
  The following hyperparameters were used during training:
41
  - learning_rate: 2e-05
42
+ - train_batch_size: 4
43
+ - eval_batch_size: 4
44
  - seed: 42
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
+ - num_epochs: 5
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
48
 
49
  ### Framework versions
50