Cable Routing Training Data

Deformable linear object manipulation datasets — cable routing, wire harness assembly, hose connection, and rope manipulation with cable state estimation and deformation-aware path planning annotations.

Data Requirements

Modality

RGB-D multi-view + cable state tracking + force/torque + tactile

Volume Range

1K-10K routing demonstrations per cable type

Temporal Resolution

30 Hz video, 15 Hz cable state estimation, 100 Hz F/T

Key Annotations

Cable centerline trajectory (ordered 3D point sequence)Routing waypoint completion stateClip engagement labels (per clip per frame)Cable tension estimationFailure mode classificationCable material properties and type identifier

Compatible Models

Diffusion PolicyACTTransporter NetworksDLO-specific RLOpenVLA (fine-tuned)Model Predictive Control (learned dynamics)

Environment Types

Automotive harness benchServer rackAerospace avionics panelIndustrial control cabinetSurgical catheter stationData center cabling bay

How Claru Supports This Task

Claru operates purpose-built cable routing collection stations equipped with calibrated multi-camera rigs (3+ RGB-D cameras), modular routing fixtures, and a library of 20+ cable types spanning rubber, PVC, braided, fiber optic, and multi-conductor sheathed variants. Our collectors are trained on teleoperation interfaces for both single-arm and bimanual routing tasks, and our annotation pipeline produces cable centerline state vectors at 15 Hz using learned multi-view tracking with human verification at 1-second intervals. Each dataset delivery includes per-frame cable state, routing waypoint annotations, clip engagement labels, tension estimates, failure mode classifications, and cable material property metadata — all verified through automated consistency checks and 20% human spot-verification. We support automotive harness assembly, data center cable management, aerospace avionics routing, and custom industrial scenarios, delivering in RLDS, HDF5, or Zarr format with full sensor calibration files and fixture geometry specifications. A typical 5,000-demonstration dataset covering 5 cable types and 3 fixtures ships in 4-6 weeks.

What Is Cable Routing and Why Does It Need Specialized Data?

Cable routing is the task of manipulating deformable linear objects (DLOs) — cables, wires, hoses, and ropes — through predetermined paths, clips, and connectors. Unlike rigid object manipulation, DLOs have infinite-dimensional configuration spaces: a 1-meter cable segment can assume an effectively unlimited set of shapes. This makes state estimation, planning, and control fundamentally harder than for rigid bodies. A robot routing a wire harness through an automotive chassis must track the cable's full 3D shape, predict how it deforms under contact, and plan manipulation sequences that avoid tangling or over-bending.

The data challenge is acute because DLO physics are notoriously difficult to simulate accurately. Material properties like stiffness, friction, and torsional resistance vary across cable types, and contact interactions between a cable and routing clips involve complex multi-point constraints. Seita et al. (2021) demonstrated that policies trained on real cable manipulation data outperform simulation-only approaches by 25-40% on routing success rate, primarily because real-world cable dynamics include effects like internal friction, memory curvature, and non-uniform stiffness that simulators approximate poorly.

Industrial demand is concentrated in automotive wire harness assembly (a $70B global market), server rack cable management, aerospace avionics routing, and surgical catheter insertion. BMW and Toyota have both published on automating wire harness installation, identifying data collection as the primary bottleneck — a single harness variant requires 200-500 demonstrations to cover the space of initial cable configurations and routing path variations. The automotive industry alone produces over 80 million vehicles per year, each containing 2-5 km of wiring harness routed through hundreds of clips and connectors.

The research community has identified three core capabilities required for learned cable routing: robust DLO state estimation (tracking the full 3D shape in real time), deformation prediction (forecasting how the cable will move under manipulation), and contact-aware planning (routing through clips without tangling or snagging). Each capability requires specialized training data. State estimation needs multi-view video with ground-truth cable centerline annotations. Deformation prediction needs paired before-after observations of cable manipulation actions. Contact-aware planning needs demonstrations through real fixtures with annotated clip engagement states and routing waypoints. No single existing dataset covers all three requirements, making custom data collection essential for production cable routing systems.

Cable Routing Data at a Glance

1K-10K

Demonstrations per cable type

30 Hz

Multi-view video capture

15 Hz

Cable state tracking rate

25-40%

Real > sim success gap (Seita 2021)

$70B

Global wire harness market

2-5 km

Wiring per vehicle

Data Requirements by Approach

Cable routing methods differ significantly in their data needs depending on the learning paradigm.

Approach	Data Volume	Key Modalities	Annotation Depth	Sim-to-Real Gap
Behavioral Cloning	500-2K demos	RGB + cable state	Waypoints + actions	High
Diffusion Policy	200-1K demos	RGB-D + proprioception	Action chunks	Moderate
RL from Demonstrations	1K-5K demos + 100K sim	Point cloud + F/T	Reward labels	Low (hybrid)
Foundation Model Fine-tuning	5K-10K demos	RGB multi-view + language	Full episode annotations	Moderate
Model Predictive Control (learned dynamics)	2K-5K interaction sequences	RGB-D + cable state + F/T	State transitions + actions	Low (learns real dynamics)

State of the Art in Cable Manipulation

The leading methods for learned cable routing build on two core capabilities: robust DLO state estimation and deformation-aware action prediction. For state estimation, Yan et al. (2020) introduced a learned cable tracking system using multi-view RGB that reconstructs the cable centerline as an ordered sequence of 3D waypoints at 15 Hz, achieving sub-centimeter accuracy on cables up to 2 meters. This representation feeds directly into policy architectures that predict manipulation actions conditioned on the current cable configuration. More recent work from Chi et al. (2024) uses diffusion-based tracking that handles occlusions and self-crossings — the hardest failure modes for earlier tracking methods.

On the policy side, Chi et al. (2023) showed that Diffusion Policy achieves 85% success rate on cable routing tasks with just 200 demonstrations, compared to 62% for ACT and 45% for standard behavioral cloning on the same dataset. The key advantage is Diffusion Policy's ability to represent the multimodal action distributions inherent in cable manipulation — there are often multiple valid ways to route a cable segment, and unimodal policies collapse to an average that fails. However, scaling beyond single cable types to production-grade multi-variant routing still requires 5-10x more demonstrations than the academic benchmarks suggest.

Recent work from Luo et al. (2024) on DLO manipulation with tactile feedback demonstrates that adding GelSight-based contact sensing reduces routing failure rates by 30% compared to vision-only policies, particularly for tasks requiring precise insertion of cables into clips or connectors. The tactile signal provides direct feedback on whether the cable is properly seated in a clip — a binary determination that vision-only approaches often misjudge due to occlusion. This suggests that next-generation cable routing datasets should include synchronized tactile data alongside visual observations.

For long-cable routing — harnesses exceeding 1 meter with 10+ routing waypoints — the challenge shifts from single-step manipulation to sequential planning. The robot must decide which end of the cable to manipulate first, plan a routing order that avoids creating loops or tangles, and maintain previously routed segments while working on new ones. This is fundamentally a combinatorial planning problem layered on top of continuous manipulation control. Demonstrations for long-cable routing must capture not just individual routing actions but the full strategy: initial cable layout assessment, routing order selection, intermediate verification steps, and recovery from routing errors.

Collection Methodology for Cable Routing Data

Effective cable routing data collection requires a purpose-built setup. The collection station needs a fixed multi-camera rig (minimum 3 calibrated RGB-D cameras) providing overlapping views of the workspace, a standardized set of cable types varying in stiffness, diameter, and surface friction, and a routing fixture with clips, channels, and connectors matching the target deployment environment. Operators collect demonstrations using teleoperation interfaces — bilateral leader-follower arms for bimanual routing tasks, or 3D SpaceMouse for single-arm scenarios.

The critical annotation layer is cable state representation. Each frame requires the cable centerline as an ordered point sequence (typically 50-200 points depending on cable length), plus metadata including the cable type identifier, which clips the cable is currently engaged with, and the estimated tension along the cable. Routing waypoints — the key intermediate states the cable should pass through — are annotated post-hoc and serve as checkpoints for evaluating policy progress. For production datasets, automated cable tracking provides the frame-by-frame centerline while human annotators verify and correct at 1-second intervals.

Diversity axes that matter most for cable routing are: cable material properties (rubber, PVC, braided, fiber optic, sheathed multi-conductor), routing fixture geometry (clip spacing, channel depth, connector type, entry angle), initial cable configuration (coiled, tangled, pre-routed partial, straight), and lighting conditions (shadows in channels, reflective connectors, dark cables on dark backgrounds). Claru's collection protocol requires a minimum of 5 cable types and 3 fixture variations per collection campaign, with operator rotation every 100 demonstrations to prevent style bias in the teleoperation trajectories.

Failure demonstration collection is deliberately structured in cable routing data. Common failure modes — cable snagging on a clip edge, creating an unintended loop, over-tensioning causing the cable to pop out of a clip, and routing in the wrong order creating an inaccessible segment — are all annotated with failure mode labels and the point in the trajectory where the error became unrecoverable. These negative examples are essential for training policies that detect and recover from routing errors rather than simply executing a nominal trajectory. Claru's protocol targets 20-30% deliberate failure demonstrations per collection campaign.

Key Datasets for Cable Routing Research

Dataset	Year	Scale	Modalities	Cable Types	Limitation
MIT Cable Dataset	2021	500 demos	RGB-D + state	3 rubber	Single fixture, limited diversity
DLO-Manipulation	2022	1.2K demos	RGB + point cloud	5 types	2D routing only, no 3D paths
RoboWire (Toyota)	2023	3K demos	RGB-D + F/T + tactile	8 harness variants	Proprietary, not publicly available
DeformableRavens (Seita et al.)	2021	1K demos	RGB-D + actions	Rope only	Transporter network format, planar only
Claru Custom	2026	1K-10K+ demos	RGB-D + state + F/T + tactile	Configurable (20+ in library)	Built to specification

How Claru Supports Cable Routing Data Needs

Claru operates purpose-built cable routing collection stations equipped with calibrated multi-camera rigs (3+ RGB-D cameras per station), standardized routing fixtures, and a library of 20+ cable types spanning rubber, PVC, braided, fiber optic, and multi-conductor sheathed variants. Our fixtures are modular — clip rails, channel boards, and connector panels can be reconfigured to match client-specific routing layouts within hours, not days.

Our collectors are trained on teleoperation interfaces for both single-arm and bimanual routing tasks, and our annotation pipeline produces cable centerline state vectors at 15 Hz using learned multi-view tracking with human verification at 1-second intervals. Each dataset delivery includes per-frame cable state, routing waypoint annotations, clip engagement labels, tension estimates, and failure mode classifications — all verified through automated consistency checks and 20% human spot-verification.

We support automotive harness assembly, data center cable management, aerospace avionics routing, and custom industrial routing scenarios. Datasets are delivered in RLDS, HDF5, or Zarr format with full sensor calibration files, cable material property sheets, fixture geometry specifications, and train/val/test splits stratified by cable type and fixture geometry. A typical 5,000-demonstration cable routing dataset covering 5 cable types and 3 fixture configurations ships in 4-6 weeks.

References

[1]Seita et al.. “Learning to Rearrange Deformable Cables, Fabrics, and Bags with Goal-Conditioned Transporter Networks.” ICRA 2021, 2021. Link
[2]Chi et al.. “Diffusion Policy: Visuomotor Policy Learning via Action Diffusion.” RSS 2023, 2023. Link
[3]Yan et al.. “Self-Supervised Learning of State Estimation for Manipulating Deformable Linear Objects.” IEEE RA-L 2020, 2020. Link
[4]Luo et al.. “Tactile-Driven Manipulation of Deformable Linear Objects.” CoRL 2024, 2024. Link
[5]Zhao et al.. “Learning Fine-Grained Bimanual Manipulation with Low-Cost Hardware.” RSS 2023, 2023. Link

Frequently Asked Questions

It depends on the policy architecture and cable variety. Diffusion Policy can achieve 85% success with 200 demonstrations on a single cable type and fixture. For production deployment across multiple cable types, expect 1,000-2,000 demonstrations per cable variant. Foundation model fine-tuning (e.g., OpenVLA for cable tasks) typically requires 5,000-10,000 demonstrations spanning the full cable and fixture diversity you plan to deploy against. Start with your most common cable-fixture combination and scale based on held-out evaluation.

DLO simulation suffers from three fundamental gaps: contact dynamics (cable-clip friction is hard to model accurately, especially for multi-point contacts), material properties (real cables have memory curvature, non-uniform stiffness, and internal friction that vary by manufacturer and even by batch), and visual realism (translucent sheaths, braided textures, and specular reflections on connectors). Seita et al. showed a 25-40% success rate gap between sim-only and real-data policies. The recommended approach is 70-80% simulation for pretraining combined with 20-30% real demonstrations for fine-tuning.

The standard representation is an ordered sequence of 3D waypoints along the cable centerline, typically 50-200 points depending on cable length. This can be extracted from multi-view RGB-D using learned trackers at 15 Hz. For policies that need deformation awareness, augment with per-segment curvature and estimated tension. Point cloud representations work for grasp-and-place approaches but lose the ordered structure critical for routing sequence planning. Newer methods also include cable cross-section orientation (twist) at each waypoint, which matters for connectors that require specific cable rotation.

Yes. Failed routing attempts — tangling, missed clips, over-tension breaks, wrong routing order — are valuable training signal. A healthy dataset contains 20-30% failure cases annotated with failure mode labels and the point in the trajectory where the error became unrecoverable. This enables learning both what to do and what to avoid, and supports reward learning for RL-based approaches. Recovery demonstrations — where the operator detects a routing error and corrects it — are especially valuable for training policies that can self-correct during deployment.

RLDS format provides the best compatibility with foundation models (OpenVLA, Octo, RT-X). Each episode should include synchronized multi-view RGB-D, cable state vectors (ordered 3D waypoint sequences), end-effector poses, and gripper actions. HDF5 works well for single-lab use with Diffusion Policy and ACT. For the cable state annotation layer specifically, a separate time-indexed array of 3D point sequences is standard, with metadata linking each waypoint to the nearest routing fixture element. Claru delivers in all three formats with full sensor calibration metadata and cable material property specifications.

Many cable routing tasks are inherently bimanual: one hand holds the cable taut or guides it while the other pushes it into a clip. Single-arm routing often results in the cable springing back out of clips because there is no stabilizing hand. Bimanual teleoperation using ALOHA-style leader-follower setups captures the natural two-handed coordination humans use for cable work — one hand anchoring, the other routing. ALOHA-collected bimanual routing demonstrations achieve 20-30% higher downstream policy success rates compared to single-arm collection on the same tasks.

Related Resources

Glossary

Manipulation Trajectory→

How To Build A Manipulation Dataset→

Get a Custom Quote for Cable Routing Data

Share your cable type specifications, routing fixture details, and target deployment environment. We will design a collection plan covering the exact diversity your policy needs.

Get in Touch Browse the Data Catalog