Is the KV cache of these models unusually high?
#6 opened about 4 hours ago
by
Hugsanir
prompt eval too slow
2
#4 opened about 1 month ago
by
lfjmgs
can you guys share the size & perlexity tables thanks
1
#3 opened about 1 month ago
by
habout632
About q4_k and q5_k
1
#2 opened about 2 months ago
by
stduhpf
Cannot load model due to invalid format
2
#1 opened 2 months ago
by
ABX-AI