Text Generation
Transformers
llama
Inference Endpoints
text-generation-inference
WizardCoder-15B-V1.1-3bit / added_tokens.json
BurnThePage's picture
Add quantized model & quantize_config
6dd282b
{
"[PAD]": 32000
}