LLMNick
/

llama3-rgm-quantization

Text Generation

text-generation-inference

Inference Endpoints

4-bit precision

Model card Files Files and versions Community

No model card

New: Create and edit this model card directly on the website!

Contribute a Model Card

Downloads last month: 3

Safetensors

Model size

4.65B params

Tensor type

F32

·

FP16

·

U8

·