Suparious vaclavkosar commited on
Commit
7c71b15
1 Parent(s): a3eb4c1

Add missing quant_config.json for compatibility with vLLM backends out of the box. (#1)

Browse files

- Add missing quant_config.json for compatibility with vLLM backends out of the box. (aa2a3bfa23cced5784ce861bac33972e542ceed9)


Co-authored-by: Vaclav Kosar <vaclavkosar@users.noreply.huggingface.co>

Files changed (1) hide show
  1. quant_config.json +6 -0
quant_config.json ADDED
@@ -0,0 +1,6 @@
 
 
 
 
 
 
 
1
+ {
2
+ "zero_point": true,
3
+ "q_group_size": 128,
4
+ "w_bit": 4,
5
+ "version": "GEMM"
6
+ }