Mistral-7B-0.3
Collection
36 items
•
Updated
This model is a fine-tuned version of mistralai/Mistral-7B-Instruct-v0.3 on the GaetanMichelet/chat-60_ft_task-2 dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
1.1617 | 0.8696 | 5 | 1.1279 |
1.0666 | 1.9130 | 11 | 1.0037 |
0.9806 | 2.9565 | 17 | 0.9287 |
0.8359 | 4.0 | 23 | 0.8394 |
0.7617 | 4.8696 | 28 | 0.8171 |
0.7312 | 5.9130 | 34 | 0.8051 |
0.6691 | 6.9565 | 40 | 0.8020 |
0.64 | 8.0 | 46 | 0.8045 |
0.5832 | 8.8696 | 51 | 0.8196 |
0.5397 | 9.9130 | 57 | 0.8470 |
0.439 | 10.9565 | 63 | 0.8771 |
0.3596 | 12.0 | 69 | 0.8885 |
0.3268 | 12.8696 | 74 | 0.9616 |
0.2584 | 13.9130 | 80 | 1.0827 |
Base model
mistralai/Mistral-7B-v0.3