Fadli Aulawi commited on
Commit
bd044ad
1 Parent(s): 351879c

Training in progress epoch 0

Browse files
Files changed (4) hide show
  1. README.md +7 -8
  2. config.json +1 -1
  3. generation_config.json +1 -1
  4. tf_model.h5 +1 -1
README.md CHANGED
@@ -1,5 +1,6 @@
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - generated_from_keras_callback
5
  model-index:
@@ -14,9 +15,9 @@ probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
- - Train Loss: 0.6846
18
- - Validation Loss: 0.8045
19
- - Epoch: 2
20
 
21
  ## Model description
22
 
@@ -35,21 +36,19 @@ More information needed
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
- - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 17733, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
39
  - training_precision: mixed_float16
40
 
41
  ### Training results
42
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
- | 1.0619 | 0.8784 | 0 |
46
- | 0.7974 | 0.8210 | 1 |
47
- | 0.6846 | 0.8045 | 2 |
48
 
49
 
50
  ### Framework versions
51
 
52
- - Transformers 4.30.2
53
  - TensorFlow 2.12.0
54
  - Datasets 2.13.1
55
  - Tokenizers 0.13.3
 
1
  ---
2
  license: apache-2.0
3
+ base_model: Helsinki-NLP/opus-mt-en-fr
4
  tags:
5
  - generated_from_keras_callback
6
  model-index:
 
15
 
16
  This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-fr](https://huggingface.co/Helsinki-NLP/opus-mt-en-fr) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 1.0609
19
+ - Validation Loss: 0.8949
20
+ - Epoch: 0
21
 
22
  ## Model description
23
 
 
36
  ### Training hyperparameters
37
 
38
  The following hyperparameters were used during training:
39
+ - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 5911, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
40
  - training_precision: mixed_float16
41
 
42
  ### Training results
43
 
44
  | Train Loss | Validation Loss | Epoch |
45
  |:----------:|:---------------:|:-----:|
46
+ | 1.0609 | 0.8949 | 0 |
 
 
47
 
48
 
49
  ### Framework versions
50
 
51
+ - Transformers 4.31.0
52
  - TensorFlow 2.12.0
53
  - Datasets 2.13.1
54
  - Tokenizers 0.13.3
config.json CHANGED
@@ -55,7 +55,7 @@
55
  "scale_embedding": true,
56
  "share_encoder_decoder_embeddings": true,
57
  "static_position_embeddings": true,
58
- "transformers_version": "4.30.2",
59
  "use_cache": true,
60
  "vocab_size": 59514
61
  }
 
55
  "scale_embedding": true,
56
  "share_encoder_decoder_embeddings": true,
57
  "static_position_embeddings": true,
58
+ "transformers_version": "4.31.0",
59
  "use_cache": true,
60
  "vocab_size": 59514
61
  }
generation_config.json CHANGED
@@ -12,5 +12,5 @@
12
  "max_length": 512,
13
  "num_beams": 4,
14
  "pad_token_id": 59513,
15
- "transformers_version": "4.30.2"
16
  }
 
12
  "max_length": 512,
13
  "num_beams": 4,
14
  "pad_token_id": 59513,
15
+ "transformers_version": "4.31.0"
16
  }
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e9d2cabff4de30ffd6fcec98b7f6ea5789d87bd93a44f05bc157253b5e574b19
3
  size 301061272
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f94ed5911ca935087546922886222b40b8ea63703b1cc21a35624394afc02207
3
  size 301061272