Human Motion-Informed World Models

Overview

About the Workshop

Human motion and activity provide a critical signal for understanding, predicting, and interacting with dynamic environments. In recent years, computer vision has made significant progress in human motion perception, activity understanding, and motion generation, providing a strong foundation for modeling human behavior and dynamics. Integrating these advances into visual world models that support prediction, planning, and decision-making is a key next step toward enabling embodied intelligent systems to reason and act effectively in human-populated scenes.

This workshop focuses on the challenge of integrating rich models of human motion and behavior into world models, an increasingly important yet still under-explored direction in visual world modeling. Topics include: 1. modeling human motion and activity in complex, interactive scenes; 2. learning dynamic world models that incorporate human behavior to represent scenes, objects, and affordances; 3. enabling efficient and robust real-world deployment, including integrated perception-planning, safe navigation and autonomous driving, and improved generalization under noise, occlusions, and distribution shifts.

This topic is closely aligned with recent progress in visual world modeling and generative simulation which aim to capture vision-based representations of the scene structure, object relations, and human dynamics. The workshop will bring together research on human motion and activity modeling, dynamic scene understanding, and world models that explicitly account for human behavior as a central component of the environment. By uniting perspectives from computer vision, embodied AI, robotics, and graphics, the workshop provides a forum to explore human-centered world models that enable socially intelligent perception and action in applications such as dynamic scene understanding, predictive navigation, autonomous driving, and human-robot interaction.

Call for Papers

Topics of Interest

🏃‍♂️

Human Motion & Activity Modeling

Trajectory, pose, mesh, and flow representations of human motion
Vision-based motion perception, tracking, and forecasting
Human motion generation and digital human modeling

🌐

Dynamic World Models

Generative and predictive world models
Visual world models incorporating human motion and behavior
Scene, object, and affordance modeling conditioned on human motion
Scene representations for dynamic environments

🚀

Efficient and Robust Real-World Deployment

Integrated perception and planning in human motion-informed models
Design choices under computational and real-time constraints
Robustness to noise, occlusions, and distribution shifts
Generalization across environments, agents, and human behaviors

Keynotes

Keynote Speakers

👤

Gerard Pons-Moll

Professor, University of Tübingen

Human motion, 3D tracking, virtual humans

👤

Kai O. Arras

Professor, University of Stuttgart

Embodied intelligence, social robotics

👤

Matthieu Cord

Professor, Sorbonne University / Director, valeo.ai

World models, vision-language models

👤

Lingni Ma

Research Scientist, Meta Reality Labs

3D scene modeling, digital humans

👤

Yuji Yasui

Executive Chief Engineer, Honda R&D

Autonomous driving, human-machine collaboration

Program

Workshop Schedule

Half-day program