guohaoli2000apache-2.0

HarnessBench-CN-v0.1

Name: HarnessBench-CN-v0.1
Creator: guohaoli2000
License: apache-2.0
Keywords: language, decision-making, tool-use, permission_gating, memory_operations, context_handling, failure_recovery, refusal_abort

A Chinese-first benchmark for evaluating robot harness decision behavior in scenarios involving permissions, memory operations, context handling, failure recovery, and refusal/abort decisions.

Downloads27
Likes1

Technical Profile

Modalities: language
Task Types: decision-makingtool-usepermission_gatingmemory_operationscontext_handlingfailure_recoveryrefusal_abort
Data Format: JSONL
License: apache-2.0

Part of the HarnessBench-CN-v0.1 family

Access

View on HuggingFace

Need custom language data?

Claru builds purpose-built datasets for any environment applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets

Xperience-10M

A large-scale egocentric multimodal dataset of human experience containing 10 million interactions and 10,000 hours of synchronized first-person recordings with six video streams, audio, stereo depth, camera pose, hand mocap, full-body mocap, IMU, and hierarchical language annotations for embodied AI, robotics, and world modeling research.

rgbaudiodepthproprioception+3

2.3M downloadsMar 2026other

OmniAction

A large-scale multimodal dataset for proactive robot manipulation comprising 141,162 episodes with cross-modal contextual instructions derived from spoken dialogue, environmental sounds, and visual cues rather than explicit commands.

rgbaudiolanguage

63K downloadsMar 2026cc-by-nc-4.0

ManiTwin-100K: Manipulation-Ready Digital Object Twins

A large-scale dataset of 100K manipulation-ready digital object twins with simulation-ready 3D meshes, physical properties, functional point annotations, grasp configurations, and language descriptions validated through physics-based simulation.

3d_meshlanguage

51K downloadsApr 2026cc-by-nc-4.0

OmniAction

A large-scale multimodal dataset for proactive robot manipulation comprising 141,162 episodes across 112 skills and 748 objects, enriched with audio, visual, and contextual instruction data for cross-modal intention recognition.

rgbaudiolanguage

46K downloadsMar 2026cc-by-nc-4.0

OmniAction

A large-scale multimodal dataset for proactive robot manipulation with 141,162 episodes covering contextual instruction following through spoken dialogue, environmental sounds, and visual cues.

rgbaudiolanguage

34K downloadsApr 2026cc-by-nc-4.0

ManiTwin-100K: Manipulation-Ready Digital Object Twins

point_cloudlanguage

18K downloadsMar 2026apache-2.0