alicecomfy's picture
Create README.md
f3e4a8a verified
lora merge as it was really tricky to get it to work of https://huggingface.co/152334H/miqu-1-70b-hermes2.5-qlora.
Base Model: Miqu 70B (Mistral AI Leak) Dequantized by 152234h
Finetune also by 152234h
Outputs seem good, but the prompting is still a bit buggy, not sure if that's an error on my part.
For me it wouldn't generate text until I activated flash attention 2 in Oogabooga. You need around 130 GB vram, 2 a100 80 or h100 work, as does 6 3090 or 4090.