t5_small_aslg_pc12 / README.md
HamdanXI's picture
Update README.md
b35e332
metadata
license: apache-2.0
tags:
  - generated_from_trainer
datasets:
  - aslg_pc12
metrics:
  - bertscore
  - bleu
  - comet
  - rouge
base_model: t5-small
pipeline_tag: translation
model-index:
  - name: t5_small_aslg_pc12
    results:
      - task:
          type: translation
          name: Translation
        dataset:
          name: aslg_pc12
          type: aslg_pc12
          config: default
          split: train
        metrics:
          - type: bleu
            value: 73.8405
            name: BLEU
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiYzM4ODViYTVlYjVjZjUwNzI2YzM4YTYxMjBlZjIxNWI2YjNmM2RkOWU1NGU3NTZlYWYxNDU3YjRlNzFmNWQ4MCIsInZlcnNpb24iOjF9.KNo-oNa4YBfVvNzs7-x5b2-J1MThZX9lgztxklJVR7uwrRMvNnJb32mThwK_4Ge_WqPcy-zFHEeF6mCKZ-QWCA
          - type: loss
            value: 0.2336091846227646
            name: loss
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiNGZlMDZlMzYxNWUzNjk0ZmM4MDg0YTc1YjUyYjcyMTJmMTQxNmVlOTAxZGU3MTY1M2FjZDBhMmIwYzQwMmIwMyIsInZlcnNpb24iOjF9.PEWz-fUp1QjRztcRLHhmInmEGbTefHq-6a9M4HUh7Krdd1Ih8aoWoMdZE8-CCKy_zS6vhZFLUbWocaJw8TH0BA
          - type: gen_len
            value: 15.4908
            name: gen_len
            verified: true
            verifyToken: >-
              eyJhbGciOiJFZERTQSIsInR5cCI6IkpXVCJ9.eyJoYXNoIjoiMzk3MmNkNDUxOGRjNmQxZDhjNGM4N2Y0NjFhMWQyOTViMjU3NzRiMTJiMzAwZjFkZjkxMTg0YzY4MTZkNjBjZiIsInZlcnNpb24iOjF9.QIx8UAWOLibfiqNhWP3e4m69rMOzrGhk4iRH2rdwN8NEFUGDJnHrnruhD6qU7doc7W770GCFOo0ZxUV01V7xDQ
train-eval-index:
  - config: default
    task: translation
    task_id: translation
    splits:
      eval_split: train
    col_mapping:
      gloss: source
      text: target

t5_small_aslg_pc12

This model is a fine-tuned version of t5-small on the aslg_pc12 dataset.

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 5e-05
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • num_epochs: 3.0

Training results

Framework versions

  • Transformers 4.34.0
  • Pytorch 2.0.1+cu118
  • Datasets 2.14.5
  • Tokenizers 0.14.1