shhossain commited on
Commit
801c90b
1 Parent(s): f37e9cf

checkpoint 171895

Browse files
Files changed (5) hide show
  1. README.md +42 -55
  2. config.json +2 -2
  3. generation_config.json +1 -1
  4. pytorch_model.bin +2 -2
  5. training_args.bin +3 -0
README.md CHANGED
@@ -1,74 +1,61 @@
1
  ---
2
  license: apache-2.0
3
- datasets:
4
- - opus100
5
- language:
6
- - bn
7
- - en
8
  metrics:
9
- - sacrebleu
10
- pipeline_tag: translation
11
- widget:
12
- - text: "Will you come home tonight?"
13
- example_title: "Example 1"
14
- - text: "I am so sorry this is a day late, guys. Unfortunately, my internet was down so it was out of my control."
15
- example_title: "Example 2"
16
  model-index:
17
- - name: shhossain/opus-mt-en-to-bn
18
- results:
19
- - task:
20
- type: translation
21
- name: Translation
22
- dataset:
23
- type: opus100
24
- name: opus100
25
- split: validation
26
- metrics:
27
- - type: Bleu
28
- value: 12.537400
29
- - type: Validation Loss
30
- value: 2.120669
31
- - type: Training Loss
32
- value: 1.771200
33
  ---
34
 
35
- # English-Bengali Translation Model
36
- This model is finetuned on `Helsinki-NLP/opus-mt-en-inc` for English to Bangla Translation.
37
 
38
- - **Developed by:** [shhossain](https://github.com/shhossain)
39
- - **Model type:** [transformer-align]
40
- - **Language(s) (NLP):** [English, Bengali]
41
- - **License:** [apache-2.0]
42
- - **Finetuned from model [Helsinki-NLP/opus-mt-en-inc]:** [Helsinki-NLP/opus-mt-en-inc](Helsinki-NLP/opus-mt-en-inc)
43
 
44
- ## Use with transformers
45
- ```python
46
- from transformers import pipeline
 
 
47
 
48
- pipe = pipeline("translation", model="shhossain/opus-mt-en-to-bn")
49
- ```
50
 
51
- ## Use with BanglaTranslationKit
52
 
53
- BanglaTranslationKit is an open-source translation package for offline conversion between both Bengali and English languages (English to Bangla and Bangla to English)
54
 
55
- ### Installation
56
 
57
- ```bash
58
- pip install bntrans
59
- ```
60
 
61
- ### Usage
62
 
63
- You can use this package to translate any Bangla to English or English to Bangla.
64
 
65
- ```python
66
- from bntrans import Translator
67
 
68
- translator = Translator(src="en", dest="bn")
69
- translation = translator.translate("Hello world!") # ্যালো বিশ্ব!
70
- print(translation)
71
- ```
 
 
 
 
72
 
73
- More info on [BanglaTranslationKit](https://github.com/shhossain/BanglaTranslationKit)
74
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
2
  license: apache-2.0
3
+ base_model: Helsinki-NLP/opus-mt-en-inc
4
+ tags:
5
+ - generated_from_trainer
 
 
6
  metrics:
7
+ - bleu
 
 
 
 
 
 
8
  model-index:
9
+ - name: opus-mt-en-inc-finetuned-en-to-bn-fp16
10
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
11
  ---
12
 
13
+ <!-- This model card has been generated automatically according to the information the Trainer had access to. You
14
+ should probably proofread and complete it, then remove this comment. -->
15
 
16
+ # opus-mt-en-inc-finetuned-en-to-bn-fp16
 
 
 
 
17
 
18
+ This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-inc](https://huggingface.co/Helsinki-NLP/opus-mt-en-inc) on the None dataset.
19
+ It achieves the following results on the evaluation set:
20
+ - Loss: 1.8782
21
+ - Bleu: 21.613
22
+ - Gen Len: 21.9459
23
 
24
+ ## Model description
 
25
 
26
+ More information needed
27
 
28
+ ## Intended uses & limitations
29
 
30
+ More information needed
31
 
32
+ ## Training and evaluation data
 
 
33
 
34
+ More information needed
35
 
36
+ ## Training procedure
37
 
38
+ ### Training hyperparameters
 
39
 
40
+ The following hyperparameters were used during training:
41
+ - learning_rate: 1e-05
42
+ - train_batch_size: 16
43
+ - eval_batch_size: 16
44
+ - seed: 42
45
+ - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
+ - lr_scheduler_type: linear
47
+ - num_epochs: 1
48
 
49
+ ### Training results
50
 
51
+ | Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
52
+ |:-------------:|:-----:|:------:|:---------------:|:------:|:-------:|
53
+ | 1.9652 | 1.0 | 171895 | 1.8782 | 21.613 | 21.9459 |
54
+
55
+
56
+ ### Framework versions
57
+
58
+ - Transformers 4.32.1
59
+ - Pytorch 2.0.1+cu118
60
+ - Datasets 2.14.4
61
+ - Tokenizers 0.13.3
config.json CHANGED
@@ -1,5 +1,5 @@
1
  {
2
- "_name_or_path": "/content/opus-mt-en-inc-finetuned-en-to-bn/checkpoint-22000",
3
  "activation_dropout": 0.0,
4
  "activation_function": "swish",
5
  "add_bias_logits": false,
@@ -55,7 +55,7 @@
55
  "share_encoder_decoder_embeddings": true,
56
  "static_position_embeddings": true,
57
  "torch_dtype": "float32",
58
- "transformers_version": "4.32.0",
59
  "use_cache": true,
60
  "vocab_size": 61760
61
  }
 
1
  {
2
+ "_name_or_path": "Helsinki-NLP/opus-mt-en-inc",
3
  "activation_dropout": 0.0,
4
  "activation_function": "swish",
5
  "add_bias_logits": false,
 
55
  "share_encoder_decoder_embeddings": true,
56
  "static_position_embeddings": true,
57
  "torch_dtype": "float32",
58
+ "transformers_version": "4.32.1",
59
  "use_cache": true,
60
  "vocab_size": 61760
61
  }
generation_config.json CHANGED
@@ -12,5 +12,5 @@
12
  "num_beams": 4,
13
  "pad_token_id": 61759,
14
  "renormalize_logits": true,
15
- "transformers_version": "4.32.0"
16
  }
 
12
  "num_beams": 4,
13
  "pad_token_id": 61759,
14
  "renormalize_logits": true,
15
+ "transformers_version": "4.32.1"
16
  }
pytorch_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:9285076ac677c350c66999179ede16ae2008fc73c1520aeb4d67f36b118c32a9
3
- size 303368453
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:62094b0cd201aa52496ab6db2ff2aa4eb331375c2637f4f82967388f43dde033
3
+ size 303372037
training_args.bin ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0f234669df19c9814c570a9349ec1ca21462043991de4ea20e86e01b8f5fb67c
3
+ size 4219