arxiv:2410.07303

Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow

Published on Oct 9

· Submitted by

Lingaaaaaaa on Oct 11

Upvote

Authors:

Fu-Yun Wang ,

Zhaoyang Huang ,

Mengdi Wang ,

Hongsheng Li

Abstract

Diffusion models have greatly improved visual generation but are hindered by slow generation speed due to the computationally intensive nature of solving generative ODEs. Rectified flow, a widely recognized solution, improves generation speed by straightening the ODE path. Its key components include: 1) using the diffusion form of flow-matching, 2) employing boldsymbol v-prediction, and 3) performing rectification (a.k.a. reflow). In this paper, we argue that the success of rectification primarily lies in using a pretrained diffusion model to obtain matched pairs of noise and samples, followed by retraining with these matched noise-sample pairs. Based on this, components 1) and 2) are unnecessary. Furthermore, we highlight that straightness is not an essential training target for rectification; rather, it is a specific case of flow-matching models. The more critical training target is to achieve a first-order approximate ODE path, which is inherently curved for models like DDPM and Sub-VP. Building on this insight, we propose Rectified Diffusion, which generalizes the design space and application scope of rectification to encompass the broader category of diffusion models, rather than being restricted to flow-matching models. We validate our method on Stable Diffusion v1-5 and Stable Diffusion XL. Our method not only greatly simplifies the training procedure of rectified flow-based previous works (e.g., InstaFlow) but also achieves superior performance with even lower training cost. Our code is available at https://github.com/G-U-N/Rectified-Diffusion.

View arXiv page View PDF Add to collection

Community

Lingaaaaaaa

Paper submitter 6 days ago

https://github.com/G-U-N/Rectified-Diffusion

wangfuyun

Paper author 6 days ago

•

edited 6 days ago

In this paper, we propose Rectified Diffusion and show

🌟 The nature of rectification proposed in rectified flow is retraining with paired noise-sample.
🌟 The rectification is not limited to flow-matching models. We extend the application scope of rectification to general diffusion models. We achieve better performance with smaller training cost.
🌟 The essential training target of rectification is to obtain the first-order ODE instead of straight ODE path. For diffusion models like DDPM, the first-order ODE path is inherently curved.
🌟 We extend rectified diffusion to multiphase setting and surpass the performance of multiphase-based rectified flow method.

[paper]: https://arxiv.org/pdf/2410.07303

[GitHub]: https://github.com/G-U-N/Rectified-Diffusion/tree/master

[HuggingFace]: https://huggingface.co/wangfuyun/Rectified-Diffusion

[中文blog]: https://zhuanlan.zhihu.com/p/884824542

We have a summative figure in the following