This version of PaliGemma fails to generate the EOS token, the generation goes on to the max_length

#4
by heydariAI - opened

Hello there, im using this version of PaliGemma and for some test cases, it doesnt generate the eos token so it goes until the generation reaches the max_length. im using the provided code script in the repository itself

Google org

Hi @heydariAI ,

<eos> and it's token id is 1.

The model did generate the EOS token, but it wasn't visible in the output because skip_special_tokens=True was used in processor.decode(). If you set skip_special_tokens=False, the EOS token will be included in the output. For more details, could you please refer to this gist file

Thank you.

Your need to confirm your account before you can post a new comment.

Sign up or log in to comment