Training in progress epoch 0

Browse files

Files changed (7) hide show

.gitattributes +1 -0
README.md +6 -15
special_tokens_map.json +5 -0
spiece.model +3 -0
tf_model.h5 +1 -1
tokenizer.json +3 -0
tokenizer_config.json +11 -0

.gitattributes CHANGED Viewed

@@ -32,3 +32,4 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text

 *.zip filter=lfs diff=lfs merge=lfs -text
 *.zst filter=lfs diff=lfs merge=lfs -text
 *tfevents* filter=lfs diff=lfs merge=lfs -text
+tokenizer.json filter=lfs diff=lfs merge=lfs -text

README.md CHANGED Viewed

@@ -12,11 +12,11 @@ probably proofread and complete it, then remove this comment. -->
 # n3wtou/mt5-small-finedtuned-4-swahili
-This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on [csebuetnlp/xlsum](https://huggingface.co/datasets/csebuetnlp/xlsum/viewer/swahili/train?p=1) dataset.
 It achieves the following results on the evaluation set:
-- Train Loss: 3.1045
-- Validation Loss: 2.5859
-- Epoch: 9
 ## Model description
@@ -35,23 +35,14 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'WarmUp', 'config': {'initial_learning_rate': 5.6e-05, 'decay_schedule_fn': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5.6e-05, 'decay_steps': 15785, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}, '__passive_serialization__': True}, 'warmup_steps': 5, 'power': 1.0, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.1}
 - training_precision: mixed_float16
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
-| 6.8543     | 3.1033          | 0     |
-| 4.3991     | 2.9232          | 1     |
-| 3.9381     | 2.8241          | 2     |
-| 3.6672     | 2.7522          | 3     |
-| 3.4848     | 2.6935          | 4     |
-| 3.3552     | 2.6611          | 5     |
-| 3.2517     | 2.6296          | 6     |
-| 3.1818     | 2.6104          | 7     |
-| 3.1348     | 2.5930          | 8     |
-| 3.1045     | 2.5859          | 9     |
 ### Framework versions

 # n3wtou/mt5-small-finedtuned-4-swahili
+This model is a fine-tuned version of [google/mt5-small](https://huggingface.co/google/mt5-small) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Train Loss: 9.3513
+- Validation Loss: 5.1821
+- Epoch: 0
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5.6e-05, 'decay_steps': 987, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
 - training_precision: mixed_float16
 ### Training results
 | Train Loss | Validation Loss | Epoch |
 |:----------:|:---------------:|:-----:|
+| 9.3513     | 5.1821          | 0     |
 ### Framework versions

special_tokens_map.json ADDED Viewed

	@@ -0,0 +1,5 @@

+{
+  "eos_token": "</s>",
+  "pad_token": "<pad>",
+  "unk_token": "<unk>"
+}

spiece.model ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ef78f86560d809067d12bac6c09f19a462cb3af3f54d2b8acbba26e1433125d6
+size 4309802

tf_model.h5 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:760f268b5d0ac4587ee568d2ed322832e4e6ce7216d0d1dd17caeaf0c808f902
 size 2225556280

 version https://git-lfs.github.com/spec/v1
+oid sha256:3b3a4b7f26fa08ac4cb751ddfa52d4edd0306af8fc8ace26ef9ab33b3bf2d6dd
 size 2225556280

tokenizer.json ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:da4980af4e0649bb07a8cffdad7344bba0401a39dc67fb0256b4da603aae65b9
+size 16330466

tokenizer_config.json ADDED Viewed

	@@ -0,0 +1,11 @@

+{
+  "additional_special_tokens": null,
+  "clean_up_tokenization_spaces": true,
+  "eos_token": "</s>",
+  "extra_ids": 0,
+  "model_max_length": 1000000000000000019884624838656,
+  "pad_token": "<pad>",
+  "sp_model_kwargs": {},
+  "tokenizer_class": "T5Tokenizer",
+  "unk_token": "<unk>"
+}