Training Cost

#16
by zhangyang-0123 - opened

Hi,

Would you mind sharing the training hours to finetune the MMDit 4? I am wondering if it is possible for anyone to distill with an affordable budget on cloud.

We trained the final version of (flux-lite-8b) on 8x H200 GPUs for about 140 hours. Plus ~1-2 extra days to pre-compute the input and target hidden states.

Sign up or log in to comment