shreetharother
FYP Stage 2 — VLA Pre-training Subset
A compact ~90 GB multi-source dataset for Vision-Language-Action (VLA) pre-training, aggregating 8 upstream robotics sources with pre-materialized 448×448 images and inline storage for single-call dataset loading.
Downloads230
Technical Profile
- Modalities
- rgblanguage
- Environment
- simulation
- Task Types
- trajectory_predictionaffordance_detectiontask_planningvisual_question_answeringimage_captioningfailure_analysis
- Data Format
- HuggingFace Datasets
- License
- other
Access
Need custom rgb data?
Claru builds purpose-built datasets for simulation applications with dense human annotations and quality assurance.
Request a Sample Pack