Duplicated from cyankiwi/Devstral-Small-2507-AWQ-4bit

btbtyler09
/

Devstral-Small-2507-AWQ

Text Generation

4-bit precision

Model card Files Files and versions

Devstral-Small-2507-AWQ / params.json

btbtyler09's picture

Duplicate from cpatonn/Devstral-Small-2507-AWQ

7a929e9 verified 10 months ago

history blame contribute delete

188 Bytes

	{
	"dim": 5120,
	"n_layers": 40,
	"head_dim": 128,
	"hidden_dim": 32768,
	"n_heads": 32,
	"n_kv_heads": 8,
	"rope_theta": 1000000000.0,
	"norm_eps": 1e-05,
	"vocab_size": 131072
	}