Ximeng0831apache-2.0

Image-Text-Point Cloud Triplets Dataset (CTP)

Name: Image-Text-Point Cloud Triplets Dataset (CTP)
Creator: Ximeng0831
License: apache-2.0
Keywords: rgb, point_cloud, language, zero-shot-image-classification, autonomous-driving

A multimodal dataset of aligned image, natural language, and 3D LiDAR point cloud triplets curated from nuScenes and KITTI for contrastive tensor pre-training and unified multimodal representation learning.

Downloads150

Technical Profile

Modalities: rgbpoint_cloudlanguage
Environment: autonomous-driving
Task Types: zero-shot-image-classification
Data Format: jsonl
License: apache-2.0

Part of the Image-Text-Point Cloud Triplets Dataset (CTP) family

Access

View on HuggingFace

Need custom rgb data?

Claru builds purpose-built datasets for autonomous-driving applications with dense human annotations and quality assurance.

Request a Sample Pack

Related Datasets

LIBERO 90 LeRobot v3

A manipulation dataset with 4,500 episodes containing dual RGB observations (head and wrist cameras), proprioceptive state, and end-effector delta actions for robotic manipulation tasks.

rgbproprioception

null downloadsJul 2026apache-2.0

Pickup Single Cotton V2 TsFile

A TsFile conversion of a LeRobot v2.1 Panda follower robotics dataset for pick and place sorting tasks of medical equipment. The dataset contains time-series action and observation state data from 50 episodes with visual data available in the original source dataset.

proprioceptionrgb

null downloadsJul 2026apache-2.0

pick_place5_20260707_132345

A robotics dataset created using LeRobot containing pick and place manipulation tasks performed by a robot arm with gripper, including RGB video observations from multiple camera viewpoints and proprioceptive state information.

rgbproprioception

null downloadsJul 2026apache-2.0

PhaForce Dataset

A real-robot contact-rich manipulation dataset containing 260 trajectories across four tasks (open drawer, plug in charger, plug in USB, wiping board) with visual and force/tactile feedback for training phase-scheduled visual-force policies.

rgbforce_torqueproprioceptiontactile

null downloadsJul 2026cc-by-nc-4.0

ABC-130k

The largest open-source robot teleoperation dataset containing bimanual manipulation trajectories collected on two-arm YAM stations with 130,822 episodes across 3,555 hours of data.

rgbproprioception

653K downloadsJul 2026apache-2.0

Hy-Embodied-0.5-VLA-Data

A large-scale bimanual manipulation dataset with 2,163 hours of high-fidelity demonstrations collected via custom fingertip UMI device with optical motion-capture, spanning 70+ manipulation tasks for training Vision-Language-Action foundation models.

rgbproprioceptionlanguage

222K downloadsJul 2026cc-by-4.0