balaramas commited on
Commit
36745e7
1 Parent(s): 3ac49fa

Model save

Browse files
Files changed (3) hide show
  1. README.md +9 -5
  2. generation_config.json +10 -0
  3. model.safetensors +1 -1
README.md CHANGED
@@ -3,14 +3,14 @@ base_model: csebuetnlp/mT5_m2o_hindi_crossSum
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
- - name: marian-t5
7
  results: []
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
- # marian-t5
14
 
15
  This model is a fine-tuned version of [csebuetnlp/mT5_m2o_hindi_crossSum](https://huggingface.co/csebuetnlp/mT5_m2o_hindi_crossSum) on an unknown dataset.
16
 
@@ -32,12 +32,16 @@ More information needed
32
 
33
  The following hyperparameters were used during training:
34
  - learning_rate: 5e-05
35
- - train_batch_size: 32
36
- - eval_batch_size: 32
37
  - seed: 42
38
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
39
  - lr_scheduler_type: linear
40
- - num_epochs: 3.0
 
 
 
 
41
 
42
  ### Framework versions
43
 
 
3
  tags:
4
  - generated_from_trainer
5
  model-index:
6
+ - name: mt5-hihi
7
  results: []
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
11
  should probably proofread and complete it, then remove this comment. -->
12
 
13
+ # mt5-hihi
14
 
15
  This model is a fine-tuned version of [csebuetnlp/mT5_m2o_hindi_crossSum](https://huggingface.co/csebuetnlp/mT5_m2o_hindi_crossSum) on an unknown dataset.
16
 
 
32
 
33
  The following hyperparameters were used during training:
34
  - learning_rate: 5e-05
35
+ - train_batch_size: 2
36
+ - eval_batch_size: 2
37
  - seed: 42
38
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
39
  - lr_scheduler_type: linear
40
+ - num_epochs: 1.0
41
+
42
+ ### Training results
43
+
44
+
45
 
46
  ### Framework versions
47
 
generation_config.json ADDED
@@ -0,0 +1,10 @@
 
 
 
 
 
 
 
 
 
 
 
1
+ {
2
+ "_from_model_config": true,
3
+ "decoder_start_token_id": 250021,
4
+ "eos_token_id": 1,
5
+ "length_penalty": 0.6,
6
+ "max_length": 84,
7
+ "num_beams": 4,
8
+ "pad_token_id": 0,
9
+ "transformers_version": "4.37.2"
10
+ }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:83d837297e070763a43247a2f84b23d9ee4060ee1ec2a5e8d6499dfad7ad320b
3
  size 2329638768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:2b5d830122bbbe5848ff983f99c1cbeb247dfd77c660d1fc850bacb07e42cee3
3
  size 2329638768