Personalized text-to-image generation with Large Language and Vision Assistant enhanced training
Preliminaries. Stable Diffusion Our method is built on top of Stable Diffusion (Rombach et al., 2022), which uses an auto-encoder to perform the ...