Loading gguf model for inference
#6 opened 18 days ago
by
Rasi1610
![](https://cdn-avatars.huggingface.co/v1/production/uploads/noauth/l_6hbe5-036PQt0KOT4R4.jpeg)
Llama.cpp server support
3
#5 opened 29 days ago
by
vigneshR
Latest llama.cpp (b3051) complains of missing pre-tokenizer file on these quants
#4 opened 29 days ago
by
Inego
Does not work /:
10
#3 opened about 1 month ago
by
erikpro007
Can you provide the template?
6
#2 opened about 1 month ago
by
yanghan111
can you provide F16.gguf ?
5
#1 opened about 1 month ago
by
praymich