metadata
license: mit
datasets:
- mor40/chitanka_raw_document
language:
- bg
metrics:
- perplexity
library_name: transformers
pipeline_tag: fill-mask
Model Card for Model ID
A LLM trained from scratch on bulgarian data. The model and the model's tokenizer are trained from scratch on bulgarian data from the chitanka dataset.
Metrics
Perprelixty - 6.75