xiaoqijian's picture
2 2

xiaoqijian

mx1024
·

AI & ML interests

None yet

Recent Activity

upvoted a collection about 13 hours ago
TinyR1
commented on an article 1 day ago
Open R1: Update #3
upvoted an article 1 day ago
Open R1: Update #3
View all activity

Organizations

OpenReasoning's profile picture

mx1024's activity

commented on Open R1: Update #3 1 day ago
view reply

How is packing implemented in your code? Have you tried using a 4D attention mask to avoid the overlap between samples that you mentioned?

upvoted an article 1 day ago