Text Generation
Transformers
llama
Inference Endpoints
open_llama_3b_v2-8k-GPTQ / gptq_model-4bit--1g.safetensors

Commit History