--- license: mit datasets: - mor40/chitanka_raw_document language: - bg metrics: - perplexity library_name: transformers pipeline_tag: fill-mask --- # Model Card for Model ID A LLM trained from scratch on bulgarian data. The model and the model's tokenizer are trained _from scratch_ on bulgarian data from the **chitanka** dataset. #### Metrics Perprelixty - 6.75