Csermely (GGUF)

GGUF quantized versions of Csermely โ€” a 138M parameter Hungarian language model. Part of the Emese model family.

Compatible with llama.cpp, Ollama, LM Studio, and other GGUF-compatible runtimes.

For the full-precision HuggingFace version, see emese-tech/csermely.

Available Quantizations

File Quantization Size Description
csermely-q8_0.gguf Q8_0 173 MB 8-bit, near-lossless quality
csermely-q4_k_m.gguf Q4_K_M 103 MB 4-bit, good quality/size balance

Usage

llama.cpp

./llama-cli -m csermely-q8_0.gguf -p "A magyar nyelv" -n 100 --repeat-penalty 1.2 --chat-template none

Ollama

ollama run emese-tech/csermely-gguf

Model Details

Parameters 137.8M
Architecture LLaMA-style (decoder-only transformer)
Context length 8,192 tokens (YaRN RoPE)
Vocabulary 32,000 (SentencePiece Unigram, Hungarian)
License MIT
Downloads last month
178
GGUF
Model size
0.2B params
Architecture
llama
Hardware compatibility
Log In to add your hardware

4-bit

8-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support