Slavic T5 Base

Aim of this model is to reach the best results for the Slavic laguages with Latin script.

It is suitable for tasks such as:

The model is trained on the selected parts of OSCAR corpus and MaCoCu corpus.

It supports this languages: Czech, Croatian, Polish , Slovak, Slovenian,

Vocabulary has 120 000 tokens, contains capital letters.

Safetensors

Model size

383M params

Tensor type

F32

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

TUKE-KEMT
/

slavic-t5-base