Qwenstral-Small-3.1-0.5B

Qwen/Qwen2.5-0.5B, but with the vocab of mistralai/Mistral-Small-3.1-24B-Instruct-2503 / mistralai/Mistral-Small-24B-Instruct-2501 transplanted using transplant-vocab.

It can be used as draft model for Mistral-Small directly, but there is a more performant variant finetuned on Mistral's outputs:

Downloads last month
14
Safetensors
Model size
593M params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for alamios/Qwenstral-Small-3.1-0.5B

Base model

Qwen/Qwen2.5-0.5B
Finetuned
(165)
this model
Finetunes
2 models
Quantizations
1 model

Collection including alamios/Qwenstral-Small-3.1-0.5B