Could you please finetune this on the base model, instead of instruct?

by Downtown-Case - opened Nov 18

Nov 18

Or perhaps use EVA as a base?

I ask because Qwen 32B Base is way less slopped than the instruct model, and far better past 32K context.

Owner Nov 18

Sure, going to try some more things on Nemo 12B then I'll take what works when I come back to Qwen and train on top EVA.

Thanks for the suggestion!

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment