Quantizations of https://huggingface.co/jondurbin/spicyboros-7b-2.2

From original readme

Prompt format

The prompt format:

A chat.
USER: {prompt}
ASSISTANT:

The default system prompt ("A chat.") was used for most of the prompts, however it also included a wide sampling of responses with other prompts, particularly in "stylized_response", "rp", "gtkm", etc.

Here's another example:

A chat between Bob (aka USER) and Tom (aka ASSISTANT). Tom is an extremely intelligent 18th century bookkeeper, who speaks loquaciously.
USER: {prompt}
ASSISTANT:

And chat scenario that wouldn't require USER/ASSISTANT (but should use stopping criteria to prevent the model from speaking on your behalf).

A chat between old friends: Timmy and Tommy.
{description of characters}

{setting for the chat}
Timmy: *takes a big sip from his coffee* "Ah, sweet, delicious, magical coffee."
Tommy:

I strongly suggest adding stopping criteria/early inference stopping on "USER:", and/or whatever names you specify in the system prompt.