Some questions about section 3.4 of the paper

#45

by johnn95 - opened Mar 22

Discussion

johnn95

Mar 22

•

edited Mar 22

1.Are D′(xt−ns, t − ns, c) and D the same model ?
2.Is the input condition c empty?
3.Is the training goal to make the student's unconditional output close to the teacher's unconditional output?
4.What are the numbers of image for training the conditional objective and finetune with unconditional objective?
Looking forward to your reply！

PeterL1n

ByteDance org Mar 22

Not the same model
c is never empty. Unconditional refers to the condition on x_t, not c.
Yes.
We used different iterations for different stages. Earlier stages converge faster. About 10k iterations more or less for each stage.

johnn95

Mar 25

Thank you for your reply

johnn95 changed discussion status to closed Mar 25

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment