Edit model card

Hindi-English Translation Model

Based on the "google/mt5-small" pre-trained model. Fine-tuned it on Hindi to English dataset.

Parameters

  • number of epochs = 8
  • batch size = 16
  • learning rate = 5e-4
  • number of batches = int(np.ceil(len(dataset) / batch size))
  • n_warmup_steps = int(number of epochs * number of batches * 0.01)

Training Loss

loss.png

Examples

example_1.png example_2.png example_3.png

Downloads last month
63
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.