Edit model card

s-man2099/google-mt5-small-good

This model is a fine-tuned version of google/mt5-small on an unknown dataset. It achieves the following results on the evaluation set:

  • Train Loss: 16.1010
  • Validation Loss: 11.7026
  • Epoch: 7

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • optimizer: {'name': 'Adafactor', 'weight_decay': None, 'clipnorm': None, 'global_clipnorm': None, 'clipvalue': None, 'use_ema': False, 'ema_momentum': 0.99, 'ema_overwrite_frequency': None, 'jit_compile': True, 'is_legacy_optimizer': False, 'learning_rate': 5.999999e-13, 'beta_2_decay': -0.8, 'epsilon_1': 1e-30, 'epsilon_2': 0.001, 'clip_threshold': 1.0, 'relative_step': True}
  • training_precision: mixed_float16

Training results

Train Loss Validation Loss Epoch
20.3587 12.0966 0
16.3140 11.7366 1
16.2230 11.7036 2
15.8933 11.7026 3
15.9325 11.7026 4
16.2546 11.7026 5
16.2823 11.7026 6
16.1010 11.7026 7

Framework versions

  • Transformers 4.34.1
  • TensorFlow 2.13.0
  • Datasets 2.14.5
  • Tokenizers 0.14.1
Downloads last month
10
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Finetuned from