ropedia-aiother

Xperience-10M

A large-scale egocentric multimodal dataset of human experience containing 10 million interactions and 10,000 hours of synchronized first-person recordings with six video streams, audio, stereo depth, camera pose, hand mocap, full-body mocap, IMU, and hierarchical language annotations for embodied AI, robotics, and world modeling research.

Downloads127K
Likes194

Technical Profile

Modalities
rgbaudiodepthproprioceptionlanguageimupoint_cloud
Environment
lab
Task Types
egocentric action recognitiontask predictionaction captioninghuman-object interactiondepth estimationhand pose estimationbody motion estimationimitation learning
Data Format
HDF5
License
other
Part of the Xperience-10M family

Community Signals

Access

Need custom rgb data?

Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets