Good!
make more happy accident in surgical finetuning - so more fewer clichés
could you just fix this when i try mlx it?
ValueError: Received 1 parameters not in model: lm_head.weight.
also same issue with 31B version....
Makes sense, considering this model has an extra tensor that others won't have! It's duplicated from the original embed_tokens, then trained in isolation.
Not an awful lot I can do about that, sorry!
I’m absolutely thrilled with this fine-tune. I tried it out for the first time today and was blown away. I knew Gemma was good at following instructions, but I’ve never seen it handle them quite this well. I asked it to incorporate natural spelling errors, repeated letters, lowercase text, and so on. At first, I was wondering what was going on—why it was writing so "weirdly"—until I realized it was simply following my system prompt :D It feels refreshing—different.