Text Generation
Transformers
English
llama
text-generation-inference
Inference Endpoints
TinyLlama-1.1B-Chat-v0.3-8.0bpw-h8-exl2 / generation_config.json

Commit History

ExLLaMA V2 quant of TinyLlama-1.1B-Chat-v0.3-8.0bpw-h8-exl2
29901b1

LoneStriker commited on