Text Generation
Transformers
English
llama
text-generation-inference
Inference Endpoints

Commit History

ExLLaMA V2 quant of TinyLlama-1.1B-Chat-v0.3-3.0bpw-h6-exl2
acf2339

LoneStriker commited on