Announcement_1

We present VISTA and Maestro, two self-improving multimodal generation agents for text-to-video and text-to-image generation, respectively.