alicecomfy
/

miqu-openhermes-full

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

miqu-openhermes-full / README.md

alicecomfy's picture

Create README.md

f3e4a8a verified 9 months ago

|

history blame contribute delete

467 Bytes

	lora merge as it was really tricky to get it to work of https://huggingface.co/152334H/miqu-1-70b-hermes2.5-qlora.

	Base Model: Miqu 70B (Mistral AI Leak) Dequantized by 152234h
	Finetune also by 152234h

	Outputs seem good, but the prompting is still a bit buggy, not sure if that's an error on my part.

	For me it wouldn't generate text until I activated flash attention 2 in Oogabooga. You need around 130 GB vram, 2 a100 80 or h100 work, as does 6 3090 or 4090.