Request for FP16/BF16 Weights for vLLM Deployment

#1
by ShunXee - opened

Hi, thank you for your great work on this model!

Would it be possible for you to also release a version in Hugging Face (non-GGUF) format that is compatible with vLLM? It would make deployment for high-throughput and production use much easier.

Thanks in advance!

Sign up or log in to comment