Text Generation
Transformers
English
llama
Inference Endpoints
text-generation-inference
TinyLlama-1.1B-Chat-v0.3-3.0bpw-h6-exl2 / generation_config.json
LoneStriker's picture
ExLLaMA V2 quant of TinyLlama-1.1B-Chat-v0.3-3.0bpw-h6-exl2
acf2339
{
"max_new_tokens": 32,
"transformers_version": "4.34.0.dev0"
}