Questions about “Minibatch Optimal Transport”

#10

by zyx1213271098 - opened 10 days ago

10 days ago

I don't understand the principle of Minibatch Optimal Transport. Can you explain it in more detail? Why is a smaller distance more advantageous for model training? What impact does this have on the inference performance?

lodestones

Owner 8 days ago

•

edited 8 days ago

yeah basically for every training batch you compute the optimal transport pairings between noise and image.

it has faster convergence because the model has more certainty when regressing on the vector to learn the expectation value

you can see here, mnist for just 1 epoch almost converged
https://x.com/LodestoneE621/status/1893408571448049685

cifar10 RF vs OT-RF

basically leaning expectation value from this is harder

than this

lodestones

Owner 8 days ago

you can see the majority of the flow path is straighter too
so it can reduce inference steps quite a bit (not as significant as reflowing it again tho)

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

Your need to confirm your account before you can post a new comment.

· Sign up or log in to comment