Text Generation
Transformers
PyTorch
Chinese
llama
Inference Endpoints
text-generation-inference
4-bit precision
gptq
q-allen's picture
Upload LlamaForCausalLM
68e40c0