minhngca commited on
Commit
bf6b001
1 Parent(s): 679a113

minhngca_lab1_finetuning_colab

Browse files
Files changed (4) hide show
  1. README.md +4 -4
  2. generation_config.json +1 -0
  3. model.safetensors +1 -1
  4. training_args.bin +1 -1
README.md CHANGED
@@ -22,7 +22,7 @@ model-index:
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
- value: 5.442046712028784
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
32
 
33
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) on the kde4 dataset.
34
  It achieves the following results on the evaluation set:
35
- - Loss: 4.7774
36
- - Bleu: 5.4420
37
 
38
  ## Model description
39
 
@@ -58,7 +58,7 @@ The following hyperparameters were used during training:
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
- - num_epochs: 1
62
  - mixed_precision_training: Native AMP
63
 
64
  ### Training results
 
22
  metrics:
23
  - name: Bleu
24
  type: bleu
25
+ value: 52.88398487672078
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
32
 
33
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) on the kde4 dataset.
34
  It achieves the following results on the evaluation set:
35
+ - Loss: 0.8556
36
+ - Bleu: 52.8840
37
 
38
  ## Model description
39
 
 
58
  - seed: 42
59
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
60
  - lr_scheduler_type: linear
61
+ - num_epochs: 3
62
  - mixed_precision_training: Native AMP
63
 
64
  ### Training results
generation_config.json CHANGED
@@ -11,5 +11,6 @@
11
  "max_length": 512,
12
  "num_beams": 4,
13
  "pad_token_id": 59513,
 
14
  "transformers_version": "4.35.2"
15
  }
 
11
  "max_length": 512,
12
  "num_beams": 4,
13
  "pad_token_id": 59513,
14
+ "renormalize_logits": true,
15
  "transformers_version": "4.35.2"
16
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:25e67788978b30c6454ead22c87ef7ba67126fce7598d870eff85acbda002f85
3
  size 298705768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e8eb885ab5a1f76534225a57286ff338a881e95c7e16c3df8e4cb696a4803d5f
3
  size 298705768
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:5251b3f0412dc8922164c41ce26bd594988b0f8b497a3f940ca29c6583e8508f
3
  size 4792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ab3e87951d3eb698ee2344da97986c70a10acfdfda9a32cb77bf7895d17785de
3
  size 4792