Date

a month ago

Organization

Paper URL

Tags

The World Action Model (WAM) is a novel AI foundational model architecture for the fields of embodied intelligence and robotics. It was first proposed by NVIDIA in February 2026, with related research published in a paper titled "...".World Action Models are Zero-shot PoliciesThe paper proposes DreamZero (a 14-parameter robot foundation model) and, for the first time, explicitly uses the term World Action Model (WAM) to define this novel architecture. The paper points out that, unlike traditional VLA (which only maps single-step actions), WAM is a foundation model that directly inherits prior knowledge of the physical world by jointly predicting the "future world state (video stream)" and the "robot's actions," thus achieving extremely strong zero-shot generalization capability (Zero-shot Policy). In addition, NVIDIA officially released an entry titled "..."What Is a World Action Model?Further explanation is needed.

In May 2026, Fudan University, Shanghai Innovation Academy, and the National University of Singapore published a paper titled "World Action Models: The Next Frontier in Embodied AIThe paper provides a systematic review, explicitly defining WAM as: "An embodied foundational model that unifies predictive state modeling with action generation, with the goal of training a joint distribution of future states and actions, not just the actions themselves."

With NVIDIA DreamZero For example, WAM's underlying architecture is actually a massive video generation model (based on a video diffusion backbone network, such as Wan2.1 or NVIDIA Cosmos). The core workflow can be divided into three steps:

Input: Current screen + voice command + robot's current status
⬇️
[WAM core model (such as the 14B parameter DiT architecture)]
⬇️
One Forward Pass:

Predicted future video frames (what the world will look like next)

The precise movements of the robot in each frame (joint trajectories of degrees of freedom)

Through this joint prediction, actions and the evolution of the physical world are inextricably linked. For a robot to generate actions correctly, it must correctly generate future videos in its mind that conform to the laws of physics (gravity, friction, occlusion relationships).

Related Wiki

Learning While Deploying

LWD is a fleet-level offline-to-online reinforcement learning framework that enables general-purpose robots to continuously collect experience and achieve self-evolution of policies.

2 months ago

Theory of Space

Spatial theory refers to the framework of an intelligent agent’s ability to construct, update and utilize spatial beliefs in an environment of incomplete information through active exploration.

3 months ago

Mean Speed Strategy (MVP)

MVP achieves single-step action generation with both high expressive power and extremely fast computation by modeling the average velocity field.

3 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

HyperAI

Date

a month ago

Organization

Paper URL

2602.15922

Related Wiki

Learning While Deploying

LWD is a fleet-level offline-to-online reinforcement learning framework that enables general-purpose robots to continuously collect experience and achieve self-evolution of policies.

2 months ago

Theory of Space

Spatial theory refers to the framework of an intelligent agent’s ability to construct, update and utilize spatial beliefs in an environment of incomplete information through active exploration.

3 months ago

Mean Speed Strategy (MVP)

MVP achieves single-step action generation with both high expressive power and extremely fast computation by modeling the average velocity field.

3 months ago

Build AI with AI

From idea to launch — accelerate your AI development with free AI co-coding, out-of-the-box environment and best price of GPUs.

AI Co-coding

Ready-to-use GPUs

Best Pricing

Get Started View Pricing

HyperAI Newsletters

Subscribe to our latest updates

We will deliver the latest updates of the week to your inbox at nine o'clock every Monday morning

Command Palette

World Action Model WAM

Build AI with AI

HyperAI Newsletters

Command Palette

World Action Model WAM

Related Wiki

Learning While Deploying

Theory of Space

Mean Speed Strategy (MVP)

Build AI with AI

HyperAI Newsletters

Command Palette

World Action Model WAM

Related Wiki

Learning While Deploying

Theory of Space

Mean Speed Strategy (MVP)

Build AI with AI

HyperAI Newsletters

Related Wiki

Learning While Deploying

Theory of Space

Mean Speed Strategy (MVP)

Related Wiki

Learning While Deploying

Theory of Space

Mean Speed Strategy (MVP)