This repo only contains the Q8, Q6, Q5, & Q4 GGUF files of Siithamo v0.4

For the details of this model, please refer to the orginal model card here

Downloads last month
1
GGUF
Model size
8.03B params
Architecture
llama

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for kromquant/L3.1-Siithamo-v0.4-8B-GGUFs

Quantized
(4)
this model

Space using kromquant/L3.1-Siithamo-v0.4-8B-GGUFs 1