MahdiSUST commited on
Commit
7bc2cc7
1 Parent(s): 8299998

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -6
README.md CHANGED
@@ -3,7 +3,7 @@ license: bigscience-openrail-m
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
- - name: mt5_large_riju_data
7
  results: []
8
  ---
9
 
@@ -20,8 +20,7 @@ It achieves the following results on the evaluation set:
20
  - eval_runtime: 229.14
21
  - eval_samples_per_second: 4.159
22
  - eval_steps_per_second: 2.082
23
- - epoch: 2.0
24
- - step: 17156
25
 
26
  ## Model description
27
 
@@ -41,9 +40,8 @@ More information needed
41
 
42
  The following hyperparameters were used during training:
43
  - learning_rate: 5.6e-05
44
- - train_batch_size: 2
45
- - eval_batch_size: 2
46
- - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
  - num_epochs: 10
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
+ - name: Bangla summarization using mt5-base
7
  results: []
8
  ---
9
 
20
  - eval_runtime: 229.14
21
  - eval_samples_per_second: 4.159
22
  - eval_steps_per_second: 2.082
23
+ - epoch: 9
 
24
 
25
  ## Model description
26
 
40
 
41
  The following hyperparameters were used during training:
42
  - learning_rate: 5.6e-05
43
+ - train_batch_size: 9
44
+ - eval_batch_size: 9
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - num_epochs: 10