Text Generation
Transformers
PyTorch
Chinese
llama
text-generation-inference
Inference Endpoints
4-bit precision
gptq