multiple description candidates to facilitate DDR and HDR training
#1
by
Arsenever
- opened
Hi, thanks for your attention to our work.
- As stated in the paper, we train HDR and DDR tasks by randomly synthesizing data samples online. So we don't design a fixed dataset for release.
- These sentences are different expressions of the same semantic meaning, implemented by the step Desc. Rewrite mentioned in the paper. You can randomly choose one of them each time during training.
Thanks for your reply! Could you tell how long time and how many A100s it takes to complete the training?
We use 16 A100-80G GPUs and it takes about 10 hours for pre-training.
jpWang
changed discussion status to
closed