bos_token_id is equals to eos_token_id

#3
by mnwato - opened

After fine-tuning the mGPT-13B model, I am facing a problem generating a sentence as long as max_length because the model does not stop itself. I suspect that this is because the model cannot detect eos_token during fine-tuning.
Upon checking the config.json file, I found that "bos_token_id": 50256 is equal to "bos_token_id": 50256.

Any help would be appreciated.

mnwato changed discussion title from why bos_token_id equals to eos_token_id to bos_token_id is equals to eos_token_id

Sign up or log in to comment