A curated list of recent diffusion models for video generation, editing, and various other applications.
[CVPR 2025] Open-source, End-to-end, Vision-Language-Action model for GUI Agent & Computer Use.
Video generation via code
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
(NeurIPS 2024) Official PyTorch implementation of LOVA3
ICML 2025 - Impossible Videos