You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
arxiv 2024, 11, Understanding World or Predicting Future? A Comprehensive Survey of World Models Paper.
benchmark
arXiv 2024, 03, HumanoidBench: Simulated Humanoid Benchmark for Whole-Body Locomotion and Manipulation PaperWebsite. $15$ whole-body manipulation and $12$ locomotion tasks. This repo contains the code for environments and training.
Cosmos Website. Paper. Autoregressive Video2World/Text2World foundation models.
toolbox
Menagerie Website MuJoCo physics engines. System identification toolbox has not been released.(up to 2025.1)
MuJoCo Playground WebsitePaper Training environments in mjx. Humanoid Locomotion, Quadruped Locomotion and Manipulation (most robot arms and hand) tasks are included.
papers
arxiv 2026, 02, World Action Models are Zero-shot Policies. Webiste. Paper.
arxiv 2026, 01, PointWorld: Scaling 3D World Models for In-The-Wild Robotic Manipulation. Website. Paper.
arxiv 2026, 01, Cosmos Policy: Fine-Tuning Video Models for Visuomotor Control and Planning. Paper. Website.
arxiv 2025, 12, What Drives Success in Physical Planning with Joint-Embedding Predictive World Models? Paper. Code.
arxiv 2025, 12, World Models Can Leverage Human Videos for Dexterous Manipulation. Paper. Website.
arxiv 2025, 12, Closing the Train-Test Gap in World Models for Gradient-Based Planning. Paper. Code.
arxiv 2025, 10, Ego-Vision World Model for Humanoid Contact Planning. Website.
unitree world model, UnifoLM-WMA-0: A World-Model-Action (WMA) Framework under UnifoLM Family. Website.
arxiv 2025, 08, Genie Envisioner: A Unified World Foundation Model for Robotic Manipulation. Website. Paper.
RSS 2025 Best Systems Paper finalist, Learned Perceptive Forward Dynamics Model for Safe and Platform-aware Robotic Navigation. Website. Paper.
RSS 2025, Particle-Grid Neural Dynamics for Learning Deformable Object Models from RGB-D Videos. Website. Paper.
ICRA 2025, World Model-based Perception for Visual Legged Locomotion. Website. Code.
arxiv 2025, 03, Multi-Stage Manipulation with Demonstration-Augmented Reward, Policy, and World Model Learning. Paper. Website.
arxiv 2025, 01, RoboHorizon: An LLM-Assisted Multi-View World Model for Long-Horizon Robotic Manipulation Papaer.
arxiv 2025, 01, Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics Paper. MBPO sim2real using world models. Quadruped locomotion tasks.
ICML 2025, Trajectory World Models for Heterogeneous Environments. Paper.
ICLR 2024, Unleashing Large-Scale Video Generative Pre-training for Visual Robot Manipulation Website.
arxiv 2024, 11, DINO-WM: World Models on Pre-trained Visual Features enable Zero-shot Planning. Website. World model for MPC. DINOv2 for representation.
CVPR 2023, Affordances from Human Videos as a Versatile Representation for Robotics. Paper. Prediction contact points and trajectory waypoints, then use it for downstream tasks (suitable for different learning paradigms).
RSS 2023, Structured World Models from Human Videos. Paper. Robot arm manipulation tasks. World Models with structured action space design.
CoRL 2023 (Oral), Finetuning Offline World Models in the Real World WebsitePaper Offline pretraining and online finetuning of world models. Robot arm manipulation tasks.
CoRL 2022, Daydreamer: World models for physical robot learning. Paper.
workshop
Neurlps 2025, Embodied World Models for Decision Making Website.