GGUF and llama.cpp
#2
by
skruse
- opened
I tried to test this model with llama.cpp and the quants from "DevQuasar/VAGOsolutions.SauerkrautLM-v2-14b-DPO-GGUF" and it only output random timestamps.
I tried with and without flash_attention, both times similiar result.
skruse
changed discussion status to
closed