Model save

Files changed (3) hide show

README.md CHANGED Viewed

@@ -3,14 +3,14 @@ base_model: csebuetnlp/mT5_m2o_hindi_crossSum
 tags:
 - generated_from_trainer
 model-index:
-- name: marian-t5
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# marian-t5
 This model is a fine-tuned version of [csebuetnlp/mT5_m2o_hindi_crossSum](https://huggingface.co/csebuetnlp/mT5_m2o_hindi_crossSum) on an unknown dataset.
@@ -32,12 +32,16 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 3.0
 ### Framework versions

 tags:
 - generated_from_trainer
 model-index:
+- name: mt5-hihi
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# mt5-hihi
 This model is a fine-tuned version of [csebuetnlp/mT5_m2o_hindi_crossSum](https://huggingface.co/csebuetnlp/mT5_m2o_hindi_crossSum) on an unknown dataset.
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 2
+- eval_batch_size: 2
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 1.0
+### Training results
 ### Framework versions

generation_config.json ADDED Viewed

+{
+  "_from_model_config": true,
+  "decoder_start_token_id": 250021,
+  "eos_token_id": 1,
+  "length_penalty": 0.6,
+  "max_length": 84,
+  "num_beams": 4,
+  "pad_token_id": 0,
+  "transformers_version": "4.37.2"
+}

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:83d837297e070763a43247a2f84b23d9ee4060ee1ec2a5e8d6499dfad7ad320b
 size 2329638768

 version https://git-lfs.github.com/spec/v1
+oid sha256:2b5d830122bbbe5848ff983f99c1cbeb247dfd77c660d1fc850bacb07e42cee3
 size 2329638768