is this model the instruct version

by shi-zheng-qxhs - opened Apr 6, 2024

Apr 6, 2024

Hi, i checked the tokenizer and found both gemma-7b-bnb-4bit and gemma-7b-it-bnb-4bit share the same tokenizer. Are both models fine-tuned instruct version?

danielhanchen

Unsloth AI org Apr 7, 2024

@shi-zheng-qxhs Oh no the it is the instruct one. I manually edited the tokenizer to expose the tokens for <start_of_turn> and <end_of_turn>. Interestingly both the instruct and base models have these tokens

shi-zheng-qxhs

Apr 7, 2024

Thanks, just wanted to make sure. :)

shi-zheng-qxhs

Apr 8, 2024

Just a few follow-up questions:

is there any specific reason the padding_side is set to right?
Can I use unsloth to perform custom training, i.e., without using any of trainer class, but with pytorch native training loop, for example.
Thanks!!!

danielhanchen

Unsloth AI org Apr 9, 2024

@shi-zheng-qxhs padding_side = "right" is for training purposes only. Change it to "left" for inference.
Yes it should work!

NickyNicky

Apr 9, 2024

fine tune and inference flash attn:
padding_side= "left"

no?

Could you explain well the (padding_side = "right")

danielhanchen

Unsloth AI org Apr 9, 2024

@NickyNicky You can use padding side left, however it makes things slower for training. I don't advise it. Unsloth itself must require right padding for training.

Yes simply after training, set tokenizer.padding_side = "left" before model.generate

NickyNicky

Apr 9, 2024

but wouldn't it trigger alerts from flash attn padding_side = "right"?

NickyNicky

Apr 9, 2024

now I'm confused haha

danielhanchen

Unsloth AI org Apr 9, 2024

@NickyNicky Oh if you're simply using HF, just use whatever they provide. Unsloth itself uses right padding