Text Generation
Transformers
llama
Inference Endpoints
text-generation-inference
WizardCoder-15B-V1.1-3bit / tokenizer.json
BurnThePage's picture
Add quantized model & quantize_config
6dd282b
File too large to display, you can check the raw version instead.