The official run_t5_mlm_flax.py is copied into the repository and is run using the hyperparameters as defined in run_t5.sh.
Training loss can be seen directly on the model card. The full training runs in finished in ca. 4 hours and 30 minutes.
Select AutoNLP in the “Train” menu to fine-tune this model automatically.
- Downloads last month