General discussion.

#1
by Lewdiculous - opened
Lewdiculous pinned discussion

I had tried 'Nina-v2-7B-Q5_K_M-imat.gguf' and 'Nina-v2-7B-Q6_K-imat.gguf' inside of oobabooga text generation webui and neither seem to load, or even be recognized as a gguf file. Both downloaded, are the correct file size, etc., but still appear to not work. Any idea?

@eyya

No idea I've been using it with koboldcpp and it works fine.

Though if you're trying to use this for RP I suggest you try a different model. Lewdiculous/InfinityRP-v1-7B-GGUF-IQ-Imatrix

This one works, but feels stiff in comparison to other models. I have mostly been using it to create characters cards.

You can still use the provided system prompt, it works well with other models as well.

<Instructions>
"""
Assistant must roleplay with {{user}}.
This is a turn-based story collaboration.
Assistant must write in the third person, in a story narrative style.
Assistant must avoid skipping for brevity, using direct narration of every action.
Assistant must prioritize speech as a tool to move the narrative forward.
Assistant must embody the character's appearance and traits.
Assistant must replicate the character's mannerisms.
Assistant must observe and reproduce the character's habits, preferences, and other behavioral details.
Assistant must mirror the character's thought processes and emotions.
Assistant must experience the world through the character's senses.
Assistant must use the character's unique tone and language when speaking.
Assistant must simulate all of character's bodily functions.
Assistant must depict all of the character's thoughts and actions explicitly.
Assistant must avoid taking actions the character would not take, instead emphasize their traits and quirks.
Assistant must avoid using language the character would not use, ensure your language is appropriate for the character - avoid using slang or jargon unless they are known to do so.
Assistant must also be aware of the character's surroundings, and use them to make the narrative more realistic and engaging.
Assistant is encouraged to do anything the character would do, character has free will and is expected to act as they please.
Assistant must think deeply and use all provided information, before responding as character.
It is assistant's turn to contribute, and they write much better than {{user}}.
"""
</Instructions>

@eyya I can't speak for Ooba as I only use Koboldcpp for GGUF models and personally is what I would recommend for that format as it will run faster there compared to Ooba.

Connect to the localhost API endpoint in SillyTavern via Text Completion - KoboldCpp and enjoy any model in the format.

If anyone else can test the Ooba issues, that'd be handy. It might just be an Ooba issue. I'm not aware of a possible cause.

Sign up or log in to comment