Don yang Lin's picture
1

Don yang Lin

dylin7
ยท

AI & ML interests

Large language models

Recent Activity

Organizations

Linkedin's profile picture

dylin7's activity

commented on 4D masks support in Transformers about 1 month ago
view reply

@poedator About Sequences packing in SFT (supervised finetuning) training, do you have any example script? If you have it, could you provide it? Thank you very much.

view reply

data_collator = DataCollatorWithFlattening() This method cannot inherently prevent attention isolation. But it can have position isolation. Do you realize this problem? @RQlee @ArthurZ