Text Generation
Transformers
Safetensors
llama
conversational
Inference Endpoints
text-generation-inference

Can't wait for Exl2 or GGUF

#2
by Novelo - opened

I'll love to be able to run this bad boy locally, without waiting a minute for every letter to pop :(

Love bagel, that is YOUR bagel, breaded... not as much :D

I found one here: https://huggingface.co/mradermacher/bagel-dpo-34b-v0.5-GGUF. It runs even better than 0.2, and I'm really enjoying your new model. It's very verbose when prompted to be, but sometimes it mixes context when the prompt describes more than one actor. Perhaps I still need to adjust the settings a bit; I don't think it's the model's fault.

Furthermore, I've noticed it output very intriguing narratives from shorter examples, presenting unique ways of expressing the same thing that flow much better in a given story excerpt. I'm really rocking it, I hope it only gets better.

@Undi95 , I encourage you, if I'm not overstepping, to consider this model for mixing with one of yours, such as Noromaid, not just for role-playing but also for its potential in short story telling.

I am working on AWQ now

Sign up or log in to comment