UC Berkeley And MIT Researchers Propose A Policy Gradient Algorithm Called Denoising …
... treats denoising diffusion as a multi-step decision-making problem, enables fine-tuning Stable Diffusion on challenging downstream objectives.