riverfog72025MIT
FuSe
A multimodal robotics dataset containing 26,866 trajectories collected on a WidowX robot with visual, tactile, sound, and action data annotated with natural language instructions.
Downloads421
Episodes26866
Why This Matters for Physical AI
This dataset enables training of generalist robot policies that leverage heterogeneous sensor modalities grounded in natural language, advancing multi-modal perception and language-conditioned manipulation in robotics.
Technical Profile
- Modalities
- rgbtactileaudioproprioception
- Robot Embodiments
- WidowX
- Environment
- lab
- Episodes
- 26866
- Annotation Types
- language_instructions
- License
- MIT
Community Signals
Top 50% by downloads
Access
Need custom rgb data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack