news

Oct 17, 2025 We present VISTA and Maestro, two self-improving multimodal generation agents for text-to-video and text-to-image generation, respectively.
May 19, 2025 We present Visual Planning, where we apply reinforcement learning post-training on pure-vision models to achieve state-of-the-art performance in visual reasoning tasks.