rujengelal commited on
Commit
f8fe839
1 Parent(s): 9a8da27

End of training

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: apache-2.0
3
- base_model: t5-small
4
  tags:
5
  - generated_from_trainer
6
  metrics:
@@ -15,11 +15,11 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # nepali_t5
17
 
18
- This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 3.4274
21
- - Bleu: 4.4727
22
- - Gen Len: 16.0017
23
 
24
  ## Model description
25
 
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
54
- | 5.0 | 1.0 | 17734 | 4.7335 | 2.2286 | 15.5907 |
55
- | 4.4395 | 2.0 | 35468 | 4.2401 | 2.9281 | 15.7406 |
56
- | 4.1509 | 3.0 | 53202 | 3.9709 | 3.206 | 16.1203 |
57
- | 3.9609 | 4.0 | 70936 | 3.7968 | 3.6191 | 15.8338 |
58
- | 3.8746 | 5.0 | 88670 | 3.6712 | 3.8795 | 16.0679 |
59
- | 3.7316 | 6.0 | 106404 | 3.5811 | 3.9517 | 15.9977 |
60
- | 3.7038 | 7.0 | 124138 | 3.5185 | 4.2873 | 16.0255 |
61
- | 3.5782 | 8.0 | 141872 | 3.4695 | 4.3817 | 16.0927 |
62
- | 3.5957 | 9.0 | 159606 | 3.4387 | 4.4197 | 16.0783 |
63
- | 3.564 | 10.0 | 177340 | 3.4274 | 4.4727 | 16.0017 |
64
 
65
 
66
  ### Framework versions
 
1
  ---
2
  license: apache-2.0
3
+ base_model: rujengelal/nepali_t5
4
  tags:
5
  - generated_from_trainer
6
  metrics:
 
15
 
16
  # nepali_t5
17
 
18
+ This model is a fine-tuned version of [rujengelal/nepali_t5](https://huggingface.co/rujengelal/nepali_t5) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 2.9595
21
+ - Bleu: 5.529
22
+ - Gen Len: 15.9474
23
 
24
  ## Model description
25
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
54
+ | 3.5866 | 1.0 | 17734 | 3.3249 | 4.351 | 16.0208 |
55
+ | 3.4502 | 2.0 | 35468 | 3.2435 | 4.6878 | 16.0742 |
56
+ | 3.4103 | 3.0 | 53202 | 3.1736 | 4.9997 | 16.024 |
57
+ | 3.306 | 4.0 | 70936 | 3.1176 | 5.0627 | 16.1368 |
58
+ | 3.1849 | 5.0 | 88670 | 3.0681 | 5.2125 | 16.0203 |
59
+ | 3.1681 | 6.0 | 106404 | 3.0311 | 5.2869 | 15.7844 |
60
+ | 3.1283 | 7.0 | 124138 | 3.0028 | 5.3816 | 16.1057 |
61
+ | 3.0484 | 8.0 | 141872 | 2.9804 | 5.3871 | 15.9089 |
62
+ | 3.0153 | 9.0 | 159606 | 2.9638 | 5.5117 | 15.8761 |
63
+ | 3.0429 | 10.0 | 177340 | 2.9595 | 5.529 | 15.9474 |
64
 
65
 
66
  ### Framework versions
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "t5-small",
3
  "architectures": [
4
  "T5ForConditionalGeneration"
5
  ],
 
1
  {
2
+ "_name_or_path": "rujengelal/nepali_t5",
3
  "architectures": [
4
  "T5ForConditionalGeneration"
5
  ],
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:14d93818b254100cb7503c2e4353164b9d45ae107bdab3e7aa64b3376840ad3e
3
  size 191081512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:46975bcd2e7480b26f221ba860538bc8dbe64cbb666e22db081bed1b6aaad404
3
  size 191081512
runs/Apr28_02-23-46_f05d94549f92/events.out.tfevents.1714271027.f05d94549f92.24.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d1c3d5db294217a20f8f1364e2f13feaa66985323ab519b635d19323a7f658d1
3
+ size 85706
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:75f7ea09b70d0e93b7fe6de0bab2000d62989352d1636f4a12be1cb2f817986f
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:bb2562c62a13d7b89c2992c3a3efcdddf69de2aad771394b18d65a9f2865a0a2
3
  size 5048