Text Generation
Transformers
Safetensors
English
llama
text-generation-inference
4-bit precision
awq
New discussion

200k -> 4k

3
#2 opened 7 months ago by ssaroya

Will it run on a 4090?

1
#1 opened 7 months ago by brifl