What's the decoder_start_token_id and eos_token_id used in training?

by cqchangm - opened Jul 5, 2024

Jul 5, 2024

Was the model finetuned this way, i.e. with <|endoftext|> at the start? Or was it just a typo?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment