Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor
Towards Multimodal Understanding via Stable Diffusion as a Task-Aware Feature Extractor. Start Chat. July 9, 2025. Authors: Vatsal Agarwal, Matthew ...