Text Generation
Transformers
English
llama
Inference Endpoints
text-generation-inference
LoneStriker's picture
ExLLaMA V2 quant of TinyLlama-1.1B-Chat-v0.3-3.0bpw-h6-exl2
acf2339
{
"<|im_end|>": 32002,
"<|im_start|>": 32001,
"[PAD]": 32000
}