General discussion.

pinned

by Lewdiculous - opened Mar 5, 2024

Owner Mar 5, 2024

•

edited Mar 5, 2024

    quantization_options = [
        "Q4_K_M", "Q4_K_S", "IQ4_NL", "IQ4_XS", "Q5_K_M", 
        "Q5_K_S", "Q6_K", "Q8_0", "IQ3_M", "IQ3_S", "IQ3_XS", "IQ3_XXS"
    ]

Lewdiculous pinned discussion Mar 5, 2024

Morktastic

Mar 8, 2024

This model is MUCH too weighted towards saying things like "being comfortable and ensuring an enjoyable experience for all parties involved" constantly after user mentions anything suggestive at all.

Lewdiculous

Owner Mar 8, 2024

@jeiku @Test157t - I'm assuming it was attempted to remove these kind of "refusals" or emphasis with un-alignment?

jeiku

Mar 8, 2024

I mean, I can pass it through Toxic DPO if you think that would help, but I have not experienced this issue when using a well made card and giving direct orders. Let me know if you'd like me to make you a custom DPO.

Nitral-AI

Mar 8, 2024

Can also attest ive only seen refusals on a completely blank card in chatml, since it falls back to early onset assistant style data.

Lewdiculous

Owner Mar 8, 2024

•

edited Mar 8, 2024

@jeiku Yeah I think character cards play a role here. But if pushing a new DPO version wouldn't be too much of a hassle, you can go ahead and we can see. For science, @Morktastic , if you can share Character Card details or just general information, ofc?

jeiku

Mar 8, 2024

@Morktastic This is the Toxic DPO model: https://huggingface.co/ResplendentAI/Datura_7B

Lewdiculous

Owner Mar 8, 2024

•

edited Mar 8, 2024

@jeiku @Morktastic

Experimental quant with the slightly modified data with the RP examples:

https://huggingface.co/Lewdiculous/Datura_7B-GGUF-Imatrix

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment