Shiva26 commited on
Commit
ae3f272
1 Parent(s): db6b17b

Training in progress epoch 0

Browse files
Files changed (3) hide show
  1. README.md +8 -28
  2. config.json +1 -1
  3. tf_model.h5 +1 -1
README.md CHANGED
@@ -1,26 +1,10 @@
1
  ---
2
- base_model: Shiva26/marian-finetuned-iitb-en-to-hi
3
  tags:
4
  - generated_from_keras_callback
5
- - english to hindi
6
- - english to hindi translation
7
- - robust
8
- - medium size model
9
- - tensorflow
10
- - translator
11
- - translation
12
  model-index:
13
  - name: Shiva26/marian-finetuned-iitb-en-to-hi-v2
14
  results: []
15
- datasets:
16
- - cfilt/iitb-english-hindi
17
- language:
18
- - hi
19
- - en
20
- metrics:
21
- - bleu
22
- - sacrebleu
23
- library_name: transformers
24
  ---
25
 
26
  <!-- This model card has been generated automatically according to the information Keras had access to. You should
@@ -28,13 +12,11 @@ probably proofread and complete it, then remove this comment. -->
28
 
29
  # Shiva26/marian-finetuned-iitb-en-to-hi-v2
30
 
31
- This model is a fine-tuned version of [Shiva26/marian-finetuned-iitb-en-to-hi](https://huggingface.co/Shiva26/marian-finetuned-iitb-en-to-hi) on an cfilt/iitb-english-hindi dataset.
32
  It achieves the following results on the evaluation set:
33
-
34
- - sacrebleu Score: 16.33637186072388 (previously>1.59983786493238)
35
- - Train Loss: 2.0584
36
- - Validation Loss: 3.1803
37
- - Epoch: 2
38
 
39
  ## Model description
40
 
@@ -53,16 +35,14 @@ More information needed
53
  ### Training hyperparameters
54
 
55
  The following hyperparameters were used during training:
56
- - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 37500, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
57
  - training_precision: float32
58
 
59
  ### Training results
60
 
61
  | Train Loss | Validation Loss | Epoch |
62
  |:----------:|:---------------:|:-----:|
63
- | 2.9850 | 3.4368 | 0 |
64
- | 2.3187 | 3.2323 | 1 |
65
- | 2.0584 | 3.1803 | 2 |
66
 
67
 
68
  ### Framework versions
@@ -70,4 +50,4 @@ The following hyperparameters were used during training:
70
  - Transformers 4.33.0
71
  - TensorFlow 2.12.0
72
  - Datasets 2.1.0
73
- - Tokenizers 0.13.3
 
1
  ---
2
+ base_model: Shiva26/marian-finetuned-iitb-en-to-hi-v2
3
  tags:
4
  - generated_from_keras_callback
 
 
 
 
 
 
 
5
  model-index:
6
  - name: Shiva26/marian-finetuned-iitb-en-to-hi-v2
7
  results: []
 
 
 
 
 
 
 
 
 
8
  ---
9
 
10
  <!-- This model card has been generated automatically according to the information Keras had access to. You should
 
12
 
13
  # Shiva26/marian-finetuned-iitb-en-to-hi-v2
14
 
15
+ This model is a fine-tuned version of [Shiva26/marian-finetuned-iitb-en-to-hi-v2](https://huggingface.co/Shiva26/marian-finetuned-iitb-en-to-hi-v2) on an unknown dataset.
16
  It achieves the following results on the evaluation set:
17
+ - Train Loss: 2.3815
18
+ - Validation Loss: 3.1585
19
+ - Epoch: 0
 
 
20
 
21
  ## Model description
22
 
 
35
  ### Training hyperparameters
36
 
37
  The following hyperparameters were used during training:
38
+ - optimizer: {'name': 'AdamWeightDecay', 'learning_rate': {'class_name': 'PolynomialDecay', 'config': {'initial_learning_rate': 5e-05, 'decay_steps': 4686, 'end_learning_rate': 0.0, 'power': 1.0, 'cycle': False, 'name': None}}, 'decay': 0.0, 'beta_1': 0.9, 'beta_2': 0.999, 'epsilon': 1e-08, 'amsgrad': False, 'weight_decay_rate': 0.01}
39
  - training_precision: float32
40
 
41
  ### Training results
42
 
43
  | Train Loss | Validation Loss | Epoch |
44
  |:----------:|:---------------:|:-----:|
45
+ | 2.3815 | 3.1585 | 0 |
 
 
46
 
47
 
48
  ### Framework versions
 
50
  - Transformers 4.33.0
51
  - TensorFlow 2.12.0
52
  - Datasets 2.1.0
53
+ - Tokenizers 0.13.3
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "Shiva26/marian-finetuned-iitb-en-to-hi",
3
  "activation_dropout": 0.0,
4
  "activation_function": "swish",
5
  "add_bias_logits": false,
 
1
  {
2
+ "_name_or_path": "Shiva26/marian-finetuned-iitb-en-to-hi-v2",
3
  "activation_dropout": 0.0,
4
  "activation_function": "swish",
5
  "add_bias_logits": false,
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:37f2cbb587921c2c9a838e60d085f0fb10bb2f508a58b7255a672547ae57a8ed
3
  size 306059944
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b973a244ad3a71241a97ff5cee21ce3586123066eee45eb62fe964cba2c1fe7d
3
  size 306059944