FactoryBenchMIT
FactoryBench
A benchmark for evaluating machine-behavior reasoning in time-series models and LLMs over industrial robotic telemetry, organized along Pearl's causal hierarchy with 70,918 question-answer pairs across four levels of reasoning.
Downloads34
Technical Profile
- Modalities
- time-seriesproprioception
- Robot Embodiments
- UR3KUKA KR10
- Environment
- labindustrial
- Task Types
- pick_and_placescrewingpeg_in_hole
- Data Format
- parquet, JSONL
- License
- MIT
Access
Need custom time-series data?
Claru builds purpose-built datasets for lab applications with dense human annotations and quality assurance.
Request a Sample Pack