guanweifan
efficient-vla-extracts
Parser-derived artifacts from robotics and vision-language-action papers collected from arXiv, including PDF text, bounding boxes, markdown, and figure extraction for the efficient-vla-wiki project.
Downloads1
Why This Matters for Physical AI
This dataset provides structured extracted artifacts from robotics and vision-language-action research papers, enabling efficient paper lookup, evidence backtracking, and knowledge base maintenance for physical AI research infrastructure.
Technical Profile
- Modalities
- textimages
- Task Types
- text-retrievalquestion-answering
- Data Format
- jsonl
- Annotation Types
- textbounding_boxessection_localizationfigure_localization
Access
Need custom text data?
Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.
Request a Sample Pack