Text Generation
Transformers
Safetensors
mixtral
conversational
text-generation-inference
4-bit precision
awq
File size: 144 Bytes
639931b
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
{
    "zero_point": true,
    "q_group_size": 128,
    "w_bit": 4,
    "version": "GEMM",
    "modules_to_not_convert": [
        "gate"
    ]
}