rujengelal commited on
Commit
dff17d2
1 Parent(s): f8fe839

End of training

Browse files
README.md CHANGED
@@ -17,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [rujengelal/nepali_t5](https://huggingface.co/rujengelal/nepali_t5) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 2.9595
21
- - Bleu: 5.529
22
- - Gen Len: 15.9474
23
 
24
  ## Model description
25
 
@@ -51,16 +51,16 @@ The following hyperparameters were used during training:
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
54
- | 3.5866 | 1.0 | 17734 | 3.3249 | 4.351 | 16.0208 |
55
- | 3.4502 | 2.0 | 35468 | 3.2435 | 4.6878 | 16.0742 |
56
- | 3.4103 | 3.0 | 53202 | 3.1736 | 4.9997 | 16.024 |
57
- | 3.306 | 4.0 | 70936 | 3.1176 | 5.0627 | 16.1368 |
58
- | 3.1849 | 5.0 | 88670 | 3.0681 | 5.2125 | 16.0203 |
59
- | 3.1681 | 6.0 | 106404 | 3.0311 | 5.2869 | 15.7844 |
60
- | 3.1283 | 7.0 | 124138 | 3.0028 | 5.3816 | 16.1057 |
61
- | 3.0484 | 8.0 | 141872 | 2.9804 | 5.3871 | 15.9089 |
62
- | 3.0153 | 9.0 | 159606 | 2.9638 | 5.5117 | 15.8761 |
63
- | 3.0429 | 10.0 | 177340 | 2.9595 | 5.529 | 15.9474 |
64
 
65
 
66
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [rujengelal/nepali_t5](https://huggingface.co/rujengelal/nepali_t5) on an unknown dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 2.6633
21
+ - Bleu: 6.3134
22
+ - Gen Len: 15.9835
23
 
24
  ## Model description
25
 
 
51
 
52
  | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
53
  |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
54
+ | 3.0928 | 1.0 | 17734 | 2.8330 | 5.4935 | 15.9053 |
55
+ | 3.101 | 2.0 | 35468 | 2.8127 | 5.5409 | 15.8787 |
56
+ | 3.0165 | 3.0 | 53202 | 2.7814 | 5.6622 | 15.9238 |
57
+ | 2.9973 | 4.0 | 70936 | 2.7532 | 5.8108 | 15.8996 |
58
+ | 2.8885 | 5.0 | 88670 | 2.7294 | 5.9077 | 15.8805 |
59
+ | 2.8114 | 6.0 | 106404 | 2.7074 | 6.1401 | 15.9749 |
60
+ | 2.7791 | 7.0 | 124138 | 2.6905 | 6.1567 | 15.9531 |
61
+ | 2.7729 | 8.0 | 141872 | 2.6782 | 6.1865 | 15.9688 |
62
+ | 2.7128 | 9.0 | 159606 | 2.6699 | 6.2233 | 16.063 |
63
+ | 2.7398 | 10.0 | 177340 | 2.6633 | 6.3134 | 15.9835 |
64
 
65
 
66
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:46975bcd2e7480b26f221ba860538bc8dbe64cbb666e22db081bed1b6aaad404
3
  size 191081512
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:af6ada7568eb6e75f298a8b60f42ce7052b3948ff38bd952975c9194922a19f0
3
  size 191081512
runs/Apr28_13-43-18_fb32f442c803/events.out.tfevents.1714311799.fb32f442c803.25.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:8c2c60a8acd44cbfb9a7290bce9d606a6a85f22714ba2d2418e26daab8026496
3
+ size 85706
tokenizer_config.json CHANGED
@@ -930,8 +930,12 @@
930
  "clean_up_tokenization_spaces": true,
931
  "eos_token": "</s>",
932
  "extra_ids": 100,
 
933
  "model_max_length": 512,
934
  "pad_token": "<pad>",
 
935
  "tokenizer_class": "T5Tokenizer",
 
 
936
  "unk_token": "<unk>"
937
  }
 
930
  "clean_up_tokenization_spaces": true,
931
  "eos_token": "</s>",
932
  "extra_ids": 100,
933
+ "max_length": 512,
934
  "model_max_length": 512,
935
  "pad_token": "<pad>",
936
+ "stride": 0,
937
  "tokenizer_class": "T5Tokenizer",
938
+ "truncation_side": "right",
939
+ "truncation_strategy": "longest_first",
940
  "unk_token": "<unk>"
941
  }
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bb2562c62a13d7b89c2992c3a3efcdddf69de2aad771394b18d65a9f2865a0a2
3
  size 5048
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b5c04d5e14d29aacbd0099666019608d5183dc1ad99c8edf6599a5de4266db64
3
  size 5048