Why is the "add_bos_token" set to True in tokenizer_config.json?

#17
by heyaa - opened

Should I keep the bos_token_id in my input_ids for downstream tasks?

Hey @heyaa ,

OPT uses a GPT2Tokenizer but prepends every prompt with a BOS TOKEN (e.g. <s> Hello there instead of Hello there)

Sign up or log in to comment