A LLM trained from scratch on bulgarian data.
The model and the model's tokenizer are trained from scratch on bulgarian data from the chitanka dataset.
Metrics
Perprelixty - 6.75
Downloads last month
1
Dataset used to train
mor40/BulBERT-chitanka-model