Papers
arxiv:2410.07303

Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow

Published on Oct 9
Β· Submitted by Lingaaaaaaa on Oct 11

Abstract

Diffusion models have greatly improved visual generation but are hindered by slow generation speed due to the computationally intensive nature of solving generative ODEs. Rectified flow, a widely recognized solution, improves generation speed by straightening the ODE path. Its key components include: 1) using the diffusion form of flow-matching, 2) employing boldsymbol v-prediction, and 3) performing rectification (a.k.a. reflow). In this paper, we argue that the success of rectification primarily lies in using a pretrained diffusion model to obtain matched pairs of noise and samples, followed by retraining with these matched noise-sample pairs. Based on this, components 1) and 2) are unnecessary. Furthermore, we highlight that straightness is not an essential training target for rectification; rather, it is a specific case of flow-matching models. The more critical training target is to achieve a first-order approximate ODE path, which is inherently curved for models like DDPM and Sub-VP. Building on this insight, we propose Rectified Diffusion, which generalizes the design space and application scope of rectification to encompass the broader category of diffusion models, rather than being restricted to flow-matching models. We validate our method on Stable Diffusion v1-5 and Stable Diffusion XL. Our method not only greatly simplifies the training procedure of rectified flow-based previous works (e.g., InstaFlow) but also achieves superior performance with even lower training cost. Our code is available at https://github.com/G-U-N/Rectified-Diffusion.

Community

Paper submitter
Paper author
β€’
edited Oct 11

In this paper, we propose Rectified Diffusion and show

  • 🌟 The nature of rectification proposed in rectified flow is retraining with paired noise-sample.
  • 🌟 The rectification is not limited to flow-matching models. We extend the application scope of rectification to general diffusion models. We achieve better performance with smaller training cost.
  • 🌟 The essential training target of rectification is to obtain the first-order ODE instead of straight ODE path. For diffusion models like DDPM, the first-order ODE path is inherently curved.
  • 🌟 We extend rectified diffusion to multiphase setting and surpass the performance of multiphase-based rectified flow method.

[paper]: https://arxiv.org/pdf/2410.07303

[GitHub]: https://github.com/G-U-N/Rectified-Diffusion/tree/master

[HuggingFace]: https://huggingface.co/wangfuyun/Rectified-Diffusion

[δΈ­ζ–‡blog]: https://zhuanlan.zhihu.com/p/884824542

We have a summative figure in the following
teaser.png

This is an automated message from the Librarian Bot. I found the following papers similar to this paper.

The following papers were recommended by the Semantic Scholar API

Please give a thumbs up to this comment if you found it helpful!

If you want recommendations for any Paper on Hugging Face checkout this Space

You can directly ask Librarian Bot for paper recommendations by tagging it in a comment: @librarian-bot recommend

Sign up or log in to comment

Models citing this paper 1

Datasets citing this paper 0

No dataset linking this paper

Cite arxiv.org/abs/2410.07303 in a dataset README.md to link it from this page.

Spaces citing this paper 0

No Space linking this paper

Cite arxiv.org/abs/2410.07303 in a Space README.md to link it from this page.

Collections including this paper 3