rad1d1m123cc-by-nc-4.0
OmniAction
A large-scale multimodal dataset for proactive robot manipulation comprising 141,162 episodes with cross-modal contextual instructions derived from spoken dialogue, environmental sounds, and visual cues rather than explicit commands.
Downloads71K
Technical Profile
- Modalities
- rgbaudiolanguage
- Environment
- simulationlab
- Task Types
- manipulationinstruction_following
- Data Format
- RLDS
- License
- cc-by-nc-4.0
Community Signals
Top 1% by downloads
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack