Text Generation
Transformers
Safetensors
Chinese
English
llama
text-generation-inference
4-bit precision