riverfog72025MIT

FuSe

A multimodal robotics dataset containing 26,866 trajectories collected on a WidowX robot with visual, tactile, sound, and action data annotated with natural language instructions.

Downloads421
Episodes26866

Why This Matters for Physical AI

This dataset enables training of generalist robot policies that leverage heterogeneous sensor modalities grounded in natural language, advancing multi-modal perception and language-conditioned manipulation in robotics.

Technical Profile

Modalities
rgbtactileaudioproprioception
Robot Embodiments
WidowX
Environment
lab
Episodes
26866
Annotation Types
language_instructions
License
MIT
Part of the FuSe family

Community Signals

Top 50% by downloads

Access

Need custom rgb data?

Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets