Without this change some problems with training appears
Nte that this line still present in gemma-2b-it and gemma-2b

cc @ArthurZ , what do you think?

Ready to merge
This branch is ready to get merged automatically.

Sign up or log in to comment