Inverse Kinematics: Computing Joint Angles for Target End-Effector Poses

Q: Should learned policies output joint-space or end-effector-space actions?

End-effector space is preferred for cross-embodiment transfer because the same action (move 5cm forward) has consistent meaning across robots. Joint space is preferred for single-embodiment policies where maximum performance matters, since it avoids IK-related issues at deployment. Modern systems like ACT support both and the choice depends on deployment requirements.

Q: What happens when the IK solver has no solution?

No IK solution means the target end-effector pose is outside the robot's reachable workspace or violates joint limits. When a learned policy commands an unreachable pose, the IK solver either fails (causing the robot to stop) or returns the nearest reachable pose (causing tracking error). Policies should be trained with data that stays within the workspace to avoid this.

Q: Why do 7-DoF arms have redundant IK solutions?

A 6-DoF task (position + orientation) can theoretically be solved by a 6-DoF arm with a unique solution. A 7-DoF arm has one extra degree of freedom, meaning infinitely many joint configurations achieve the same end-effector pose. The redundancy is typically resolved by optimizing a secondary objective like avoiding joint limits, singularities, or obstacles.

Inverse kinematics (IK) is the mathematical problem of computing the joint angles or configurations that position a robot's end-effector at a desired pose in Cartesian space. IK is foundational to all robot manipulation — every motion that places a gripper at a target location requires solving an IK problem, whether analytically, numerically, or implicitly through a learned policy.

What Is Inverse Kinematics?

Inverse kinematics (IK) is the mathematical problem of computing the joint configuration of a robotic manipulator that places its end-effector at a specified position and orientation in Cartesian space. For a serial kinematic chain with n joints, IK maps from the 6-dimensional task space (3 translation + 3 rotation) to the n-dimensional joint space. When n > 6, the system is redundant and has infinitely many solutions; when n = 6, solutions are typically discrete; when n < 6, the system is under-actuated and cannot reach arbitrary poses.

The mathematical formulation involves inverting the forward kinematics function. Forward kinematics computes the end-effector pose T as a function of joint angles q: T = FK(q). IK seeks q such that FK(q) = T_target. For serial manipulators, FK is a product of homogeneous transformation matrices, one per joint. The inverse is non-trivial because FK is non-linear and may be non-injective (multiple q map to the same T).

Three solution strategies dominate: analytical methods that exploit specific kinematic structures to derive closed-form solutions (fast but only available for certain robot geometries), numerical methods that iteratively minimize the pose error using Jacobian-based optimization (general but slower), and sampling-based methods that search the joint space for valid configurations (most general but slowest). In practice, most modern robot arms have analytical IK solutions available through tools like IKFast, with numerical methods as fallback for singularity regions.

Historical Context

Inverse kinematics has been a central problem in robotics since the field's inception. Pieper (1968) provided one of the first systematic treatments of IK for serial manipulators, deriving closed-form solutions for specific kinematic configurations. The Denavit-Hartenberg convention (1955) standardized the representation of kinematic chains, enabling systematic IK derivation.

The development of numerical IK solvers progressed through the Jacobian transpose method, the pseudoinverse method, and damped least squares (Wampler, 1986; Nakamura & Hanafusa, 1986). These methods handle arbitrary kinematic structures but can be slow and may converge to local minima. TRAC-IK (Beeson & Ames, 2015) combined sequential quadratic programming with KDL's Newton-Raphson solver, running both in parallel and returning whichever finds a solution first.

The advent of learned manipulation policies has partially sidestepped the IK problem. Policies that output joint-space actions learn the inverse mapping implicitly. However, the shift toward end-effector-space actions for cross-embodiment transfer has renewed the importance of robust IK solvers in deployment pipelines.

Practical Implications

For teams building manipulation training pipelines, IK configuration directly affects data quality. During teleoperation, the IK solver determines how the operator's hand motions translate to robot joint motions. Solver inconsistencies between sessions — using different solution branches, different redundancy resolution strategies, or different joint limit handling — introduce artificial variation that the learning algorithm may interpret as meaningful.

Standard practice is to fix the IK solver and all its parameters (seed configuration, redundancy resolution method, joint limit margins) across all data collection sessions for a given dataset. This configuration should be documented and shipped with the dataset so downstream users can replicate the exact IK behavior if needed.

Claru documents the IK configuration used during data collection for every dataset, records both joint-space and end-effector-space action labels, and validates consistency between the two representations. This enables teams to train in either action space without concern about IK-related artifacts.

Common Misconceptions

MYTH

Learned policies eliminate the need to understand IK.

FACT

Policies that output end-effector actions still require IK at deployment time to convert those actions to joint commands. Understanding IK failure modes (singularities, workspace limits, discontinuous solutions) is essential for debugging deployment issues even with learned policies.

MYTH

IK always has a unique solution.

FACT

For redundant manipulators (7+ DoF), IK has infinitely many solutions. Even for 6-DoF arms, most configurations have multiple discrete solutions (elbow-up vs. elbow-down, wrist-flip configurations). The choice of solution affects motion smoothness, joint limit proximity, and collision avoidance.

MYTH

Joint-space action policies are always better than end-effector-space policies.

FACT

Joint-space policies avoid IK issues but are inherently embodiment-specific — they cannot transfer between robots with different kinematic structures. End-effector-space policies with IK enable cross-embodiment transfer and are the standard for generalist robot policies.

Key Papers

[1]Beeson & Ames. “TRAC-IK: An Open-Source Library for Improved Solving of Generic Inverse Kinematics.” IEEE-RAS Humanoids 2015, 2015. Link
[2]Diankov. “Automated Construction of Robotic Manipulation Programs.” PhD Thesis, CMU 2010, 2010. Link
[3]Buss. “Introduction to Inverse Kinematics with Jacobian Transpose, Pseudoinverse and Damped Least Squares Methods.” Technical Report, UCSD 2004, 2004. Link

How Claru Supports This

Claru records both joint-space and end-effector-space action labels in all demonstration datasets, with documented IK configurations. This dual representation supports flexible training paradigms while ensuring IK consistency across data collection sessions.

What Is Inverse Kinematics?

Inverse kinematics (IK) is the computational problem of determining the joint configuration of a robot that places its end-effector at a specified position and orientation in 3D space. Given a target pose — a 3D position (x, y, z) and a 3D orientation (roll, pitch, yaw or quaternion) — the IK solver computes the angles of each joint in the robot's kinematic chain that achieve that pose. For a typical 7-DoF manipulator like the Franka Panda, IK maps from 6D task space (3 translation + 3 rotation) to 7D joint space.

The forward kinematics problem — computing the end-effector pose given joint angles — is straightforward: multiply the transformation matrices for each link in the chain. Inverse kinematics is harder because the mapping from task space to joint space is generally non-linear, may have multiple solutions (redundancy), and may have no solution (the target is outside the reachable workspace). A 7-DoF arm has infinite IK solutions for most reachable poses because it has one degree of redundancy.

In the context of learned manipulation policies, IK plays two distinct roles. For policies that output end-effector space actions (position and orientation deltas), an IK solver converts these commands to joint-level commands that the robot controller executes. For policies that output joint-space actions directly, IK is implicit — the policy has learned the inverse mapping through training data. Understanding IK is essential for designing action spaces, interpreting policy behavior, and debugging deployment failures.

Inverse Kinematics at a Glance

6-DoF

Minimum joints for full IK

7-DoF

Standard redundant arm

<1ms

Analytical IK solve time

10-50ms

Numerical IK solve time

IKFast

Popular analytical solver

TRAC-IK

Popular numerical solver

IK Solvers in Robot Learning Pipelines

When a learned policy outputs end-effector actions (the dominant paradigm for cross-embodiment transfer), those actions must be converted to joint commands through IK. The choice of IK solver affects policy behavior: analytical solvers (IKFast) are fast but may produce discontinuous joint motions; numerical solvers (TRAC-IK, Levenberg-Marquardt) are smoother but slower; and optimization-based solvers can incorporate additional objectives like minimizing joint velocity or avoiding joint limits.

For teleoperation data collection — the primary source of manipulation training data — IK determines how the operator's hand motions are mapped to robot joint motions. The IK configuration (which solution branch to use, how to handle redundancy) affects the resulting demonstrations. Inconsistent IK settings between demonstrations can introduce spurious variation that confuses the learning algorithm. Standardizing the IK solver and its parameters across all data collection sessions is a critical data quality measure.

Some modern approaches bypass explicit IK entirely. Policies that output joint-space actions learn the inverse mapping implicitly from demonstrations. Diffusion Policy and ACT, when configured with joint-space action outputs, directly predict joint angle trajectories without any IK solver in the loop. This simplifies the deployment pipeline but couples the policy to a specific robot morphology, making cross-embodiment transfer harder.

IK Approaches Compared

Different inverse kinematics strategies used in robot manipulation systems.

Method	Speed	Smoothness	Constraints	Use Case
Analytical (IKFast)	<1ms	May have discontinuities	Joint limits only	Real-time control
Numerical (TRAC-IK)	10-50ms	Good	Joint limits, preferences	Motion planning
Optimization-based	50-200ms	Best	Full (collision, limits, velocity)	Complex scenes
Learned (implicit in policy)	Policy inference time	Depends on training	Learned from data	End-to-end learning

IK Considerations for Training Data

The choice of action space in training data — joint-space vs. end-effector space — has significant implications for IK. Joint-space action data records the actual joint angles commanded at each timestep, capturing the specific IK solution used during data collection. End-effector space action data records the target pose, abstracting away the IK solution. Joint-space data is embodiment-specific but avoids IK-related artifacts at deployment; end-effector data enables cross-embodiment transfer but requires solving IK at deployment time.

For teams collecting demonstration data, recording both representations simultaneously is recommended: joint positions from the robot's encoders and end-effector pose from forward kinematics. This dual recording enables training policies in either action space without re-collecting data. It also provides a validation check — the forward kinematics of the recorded joint positions should match the recorded end-effector poses.

Claru's demonstration datasets include both joint-space and end-effector-space action labels, synchronized at the robot's native control frequency. This dual representation supports both embodiment-specific and cross-embodiment training paradigms, with consistent IK configurations documented for each dataset.

Key References

[1]Beeson & Ames. “TRAC-IK: An Open-Source Library for Improved Solving of Generic Inverse Kinematics.” IEEE-RAS Humanoids 2015, 2015. Link
[2]Diankov. “Automated Construction of Robotic Manipulation Programs.” PhD Thesis, CMU 2010, 2010. Link
[3]Buss. “Introduction to Inverse Kinematics with Jacobian Transpose, Pseudoinverse and Damped Least Squares Methods.” Technical Report, UCSD 2004, 2004. Link

Frequently Asked Questions

End-effector space is preferred for cross-embodiment transfer because the same action (move 5cm forward) has consistent meaning across robots. Joint space is preferred for single-embodiment policies where maximum performance matters, since it avoids IK-related issues at deployment. Modern systems like ACT support both and the choice depends on deployment requirements.

No IK solution means the target end-effector pose is outside the robot's reachable workspace or violates joint limits. When a learned policy commands an unreachable pose, the IK solver either fails (causing the robot to stop) or returns the nearest reachable pose (causing tracking error). Policies should be trained with data that stays within the workspace to avoid this.

A 6-DoF task (position + orientation) can theoretically be solved by a 6-DoF arm with a unique solution. A 7-DoF arm has one extra degree of freedom, meaning infinitely many joint configurations achieve the same end-effector pose. The redundancy is typically resolved by optimizing a secondary objective like avoiding joint limits, singularities, or obstacles.

Related Resources

Cross Embodiment Transfer→

Glossary

Proprioceptive Data→

Guide

How To Collect Teleoperation Data→

Solution

Manipulation Trajectory Data→

Need Robot Manipulation Training Data?

Claru provides demonstration datasets with both joint-space and end-effector-space action labels, supporting flexible IK configurations for your training pipeline.

Get in Touch Browse the Data Catalog