IffYuanother

Embodied-R1-3B-v1

A 3B vision-language model for general robotic manipulation that introduces a Pointing mechanism and uses Reinforced Fine-tuning to bridge perception and action with strong zero-shot generalization in embodied tasks.

Downloads1K

Technical Profile

Modalities
rgblanguage
Environment
lab
Task Types
manipulationvisual_target_groundingreferring_region_groundingopen_form_grounding
License
other
Part of the Embodied-R1-3B-v1 family

Community Signals

Top 25% by downloads

Access

Need custom rgb data?

Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets