IffYuanother
Embodied-R1-3B-v1
A 3B vision-language model for general robotic manipulation that introduces a Pointing mechanism and uses Reinforced Fine-tuning to bridge perception and action with strong zero-shot generalization in embodied tasks.
Downloads1K
Technical Profile
- Modalities
- rgblanguage
- Environment
- lab
- Task Types
- manipulationvisual_target_groundingreferring_region_groundingopen_form_grounding
- License
- other
Community Signals
Top 25% by downloads
Access
Need custom rgb data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack