TasmiaAzmi commited on
Commit
909adbd
·
1 Parent(s): 584e687

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +18 -12
README.md CHANGED
@@ -12,9 +12,9 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # masked-sentence-generation
14
 
15
- This model is a fine-tuned version of [google/flan-t5-large](https://huggingface.co/google/flan-t5-large) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: nan
18
 
19
  ## Model description
20
 
@@ -34,23 +34,29 @@ More information needed
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0001
37
- - train_batch_size: 1
38
- - eval_batch_size: 1
39
  - seed: 42
40
  - gradient_accumulation_steps: 16
41
- - total_train_batch_size: 16
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - num_epochs: 7
45
 
46
  ### Training results
47
 
48
- | Training Loss | Epoch | Step | Validation Loss |
49
- |:-------------------------------:|:-----:|:----:|:---------------:|
50
- | 84911378280078883749363712.0000 | 1.5 | 100 | nan |
51
- | 0.0 | 2.99 | 200 | nan |
52
- | 0.0 | 4.49 | 300 | nan |
53
- | 0.0 | 5.98 | 400 | nan |
 
 
 
 
 
 
54
 
55
 
56
  ### Framework versions
 
12
 
13
  # masked-sentence-generation
14
 
15
+ This model is a fine-tuned version of [t5-base](https://huggingface.co/t5-base) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Loss: 2.6669
18
 
19
  ## Model description
20
 
 
34
 
35
  The following hyperparameters were used during training:
36
  - learning_rate: 0.0001
37
+ - train_batch_size: 4
38
+ - eval_batch_size: 4
39
  - seed: 42
40
  - gradient_accumulation_steps: 16
41
+ - total_train_batch_size: 64
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
+ - num_epochs: 10
45
 
46
  ### Training results
47
 
48
+ | Training Loss | Epoch | Step | Validation Loss |
49
+ |:-------------:|:-----:|:----:|:---------------:|
50
+ | 2.9063 | 0.99 | 100 | 2.6335 |
51
+ | 2.76 | 1.99 | 200 | 2.6237 |
52
+ | 2.6728 | 2.98 | 300 | 2.6239 |
53
+ | 2.5965 | 3.97 | 400 | 2.6301 |
54
+ | 2.535 | 4.96 | 500 | 2.6386 |
55
+ | 2.483 | 5.96 | 600 | 2.6485 |
56
+ | 2.4399 | 6.95 | 700 | 2.6544 |
57
+ | 2.41 | 7.94 | 800 | 2.6620 |
58
+ | 2.3816 | 8.93 | 900 | 2.6658 |
59
+ | 2.3743 | 9.93 | 1000 | 2.6669 |
60
 
61
 
62
  ### Framework versions