maple.backend.policy.openvla.OpenVLAPolicy.act

OpenVLAPolicy.act(handle: PolicyHandle, payload: Any, instruction: str, model_kwargs: Dict[str, Any] | None = {}) → List[float]

Get action prediction for a single observation.

Sends a visual observation and language instruction to the OpenVLA model and receives a predicted action. The action is unnormalized using dataset statistics specified by unnorm_key.

IMPORTANT: OpenVLA requires unnorm_key to be specified. The model outputs normalized actions that must be converted to the target action space using dataset-specific statistics. Without unnormalization, the actions cannot be executed on real robots or simulators.

Parameters:

handle – Policy handle for the running container.
payload – Observation payload containing ‘image’ key with image data.
instruction – Natural language instruction for the task.
model_kwargs – Model-specific parameters. Must contain ‘unnorm_key’. (REQUIRED). Examples: ‘libero_spatial’, ‘bridge’, ‘fractal’.

Returns:

Predicted action as list of floats, unnormalized to target space.