请问这个4bit模型支持vllm启动吗?
#11 opened 5 months ago
by
SongXiaoMao
AWQ quantised?
#8 opened 7 months ago
by
epignatelli
Possible to do inference on long contexts with limited VRAM?
1
#6 opened 8 months ago
by
danabo
Excellant model, fine tuning resources
#5 opened 8 months ago
by
ewre324
Running on 3x24 GB RAM?
3
#3 opened 8 months ago
by
Marcophono