Video World Models

Back to the Features: DINO as a Foundation for Video World Models
ICML Workshop 2025 - Learning physical world models in the latent space of DINOv2 from uncurated web videos.
Back to the Features: DINO as a Foundation for Video World Models