bos_token_id is equals to eos_token_id
#3
by
mnwato
- opened
After fine-tuning the mGPT-13B
model, I am facing a problem generating a sentence as long as max_length
because the model does not stop itself. I suspect that this is because the model cannot detect eos_token
during fine-tuning.
Upon checking the config.json
file, I found that "bos_token_id": 50256
is equal to "bos_token_id": 50256
.
Any help would be appreciated.
mnwato
changed discussion title from
why bos_token_id equals to eos_token_id
to bos_token_id is equals to eos_token_id