can this model run with `ollama` with `pure cpu` model?
#7 opened 2 days ago
by
ice6
Add `quantization_config` in config.json?
4
#4 opened 6 days ago
by
WeiwenXia
运行channel INT8后sglang报错OOM
1
#3 opened 8 days ago
by
zhangneilc