working exlama quants with fixed end token

by Skorcht - opened Apr 21, 2024

Apr 21, 2024

when will we get exlama quant? and whats the quality reduction looking like? normally dolphin finetunes are dumber... and will it still adhere to some forms of morals?

bartowski

Cognitive Computations org Apr 21, 2024

https://huggingface.co/bartowski/dolphin-2.9-llama3-8b-exl2

ehartford changed discussion status to closed Apr 21, 2024

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment