Tajik Language Models
Collection
17 items
•
Updated
This model is a fine-tuned version of gpt2 on the None dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
7.1107 | 1.0 | 2405 | 6.9547 |
6.7012 | 2.0 | 4810 | 6.6086 |
6.5467 | 3.0 | 7215 | 6.5076 |