majentik
/

gemma-4-E2B-it-RotorQuant-MLX-4bit

Image-Text-to-Text

kv-cache-quantization

4-bit precision

Model card Files Files and versions

gemma-4-E2B-it-RotorQuant-MLX-4bit / generation_config.json

majentik's picture

Add MLX quantized model with KV cache compression

70f4fd0 verified about 1 month ago

history blame contribute delete

208 Bytes

	{
	"bos_token_id": 2,
	"do_sample": true,
	"eos_token_id": [
	1,
	106,
	50
	],
	"pad_token_id": 0,
	"temperature": 1.0,
	"top_k": 64,
	"top_p": 0.95,
	"transformers_version": "5.5.0.dev0"
	}