Asia-N's picture
update model card README.md
00afd7d
|
raw
history blame
2.34 kB
metadata
license: apache-2.0
tags:
  - generated_from_trainer
datasets:
  - news_commentary
metrics:
  - bleu
model-index:
  - name: opus-mt-ar-en-finetuned-ar-to-en
    results:
      - task:
          name: Sequence-to-sequence Language Modeling
          type: text2text-generation
        dataset:
          name: news_commentary
          type: news_commentary
          args: ar-en
        metrics:
          - name: Bleu
            type: bleu
            value: 32.7441

opus-mt-ar-en-finetuned-ar-to-en

This model is a fine-tuned version of Helsinki-NLP/opus-mt-ar-en on the news_commentary dataset. It achieves the following results on the evaluation set:

  • Loss: 8.4400
  • Bleu: 32.7441
  • Gen Len: 57.02

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 2e-09
  • train_batch_size: 16
  • eval_batch_size: 16
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 10

Training results

Training Loss Epoch Step Validation Loss Bleu Gen Len
No log 1.0 4 8.4406 32.7441 57.02
No log 2.0 8 8.4405 32.7441 57.02
No log 3.0 12 8.4403 32.7441 57.02
No log 4.0 16 8.4402 32.7441 57.02
No log 5.0 20 8.4401 32.7441 57.02
No log 6.0 24 8.4400 32.7441 57.02
No log 7.0 28 8.4400 32.7441 57.02
No log 8.0 32 8.4400 32.7441 57.02
No log 9.0 36 8.4400 32.7441 57.02
No log 10.0 40 8.4400 32.7441 57.02

Framework versions

  • Transformers 4.20.0
  • Pytorch 1.11.0+cu113
  • Datasets 2.3.2
  • Tokenizers 0.12.1