Text Generation
Transformers
English
llama
Inference Endpoints
text-generation-inference
vicuna-13b-cocktail / vicuna-13b-cocktail-v1-triton-4bit-128g.safetensors

Commit History

Add triton quantized safetensors
f2afd12

reeducator commited on