guanweifan

efficient-vla-extracts

Parser-derived artifacts from robotics and vision-language-action papers collected from arXiv, including PDF text, bounding boxes, markdown, and figure extraction for the efficient-vla-wiki project.

Downloads1

Why This Matters for Physical AI

This dataset provides structured extracted artifacts from robotics and vision-language-action research papers, enabling efficient paper lookup, evidence backtracking, and knowledge base maintenance for physical AI research infrastructure.

Technical Profile

Modalities
textimages
Task Types
text-retrievalquestion-answering
Data Format
jsonl
Annotation Types
textbounding_boxessection_localizationfigure_localization
Part of the efficient-vla-wiki family

Access

Need custom text data?

Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets