LegalLMs
Collection
XLM-RoBERTa models with continued pretraining on the MultiLegalPile
•
37 items
•
Updated
•
3
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
0.7902 | 2.02 | 50000 | 0.6112 |
0.7387 | 4.03 | 100000 | 0.5347 |
0.6488 | 6.05 | 150000 | 0.5026 |
0.6702 | 8.07 | 200000 | 0.4841 |