rad1d1m123cc-by-nc-4.0

OmniAction

A large-scale multimodal dataset for proactive robot manipulation comprising 141,162 episodes with cross-modal contextual instructions derived from spoken dialogue, environmental sounds, and visual cues rather than explicit commands.

Downloads71K

Technical Profile

Modalities
rgbaudiolanguage
Environment
simulationlab
Task Types
manipulationinstruction_following
Data Format
RLDS
License
cc-by-nc-4.0
Part of the OmniAction family

Community Signals

Top 1% by downloads

Access

Need custom rgb data?

Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets