Edit model card
README.md exists but content is empty. Use the Edit model card button to edit it.
Downloads last month
723
Safetensors
Model size
1.98B params
Tensor type
I32
·
FP16
·
Inference API
Input a message to start chatting with Infinirc/Infinirc-Llama3-8B-4bit-AWQ-GEMM-Beta.
This model can be loaded on Inference API (serverless).