Edit model card

原始模型:https://huggingface.co/SakuraLLM/Sakura-13B-Qwen2beta-v0.9

4Bit AWQ量化,未测试,不建议使用。

GroupSize=64

适用于Kaggle双卡推理。

Downloads last month
2
Safetensors
Model size
3.36B params
Tensor type
I32
·
FP16
·
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.