Thermal Industrial Dataset

Paired thermal-RGB imaging from industrial environments for training predictive maintenance robots and safety monitoring systems. 25K+ clips across 10+ facility types with thermal anomaly and equipment health annotations.

Dataset at a Glance

25K+
Video clips
180+
Hours captured
10+ facility types
Environments
10+
Annotation layers

Why Thermal Industrial Data Matters for Robotics

Industrial inspection and predictive maintenance represent one of the largest near-term markets for autonomous robotics. Companies like Boston Dynamics (Spot), ANYbotics (ANYmal), and Flyability (Elios) deploy inspection robots in refineries, power plants, manufacturing floors, and data centers. These robots increasingly carry thermal cameras alongside RGB sensors because thermal imaging reveals equipment health information invisible to visible-light cameras: overheating bearings, electrical hot spots, insulation failures, steam leaks, and abnormal process temperatures.

Training thermal perception models is fundamentally different from RGB perception. Thermal images have lower spatial resolution, no color information in the visible sense, and a dynamic range determined by temperature differences rather than reflectance. Objects that are visually distinct in RGB may appear identical in thermal (two pipes at the same temperature), while objects that look identical in RGB may show dramatic thermal contrast (a failing bearing vs. a healthy one). Robots that combine thermal and RGB perception need training data where both modalities are aligned, time-synchronized, and annotated with the thermal-specific features that indicate equipment health.

Existing thermal datasets are overwhelmingly single-frame images from handheld surveys or surveillance cameras. They lack the temporal context that inspection robots need: how does a motor's thermal signature change during a startup sequence? What is the normal thermal gradient along a steam pipe, and how does it shift when insulation degrades? Claru's thermal industrial dataset captures continuous video from both thermal and RGB cameras simultaneously, preserving the temporal patterns that are essential for anomaly detection.

Research presented at the IEEE International Conference on Robotics and Automation (ICRA 2024) and the International Symposium on Industrial Electronics (ISIE 2023) demonstrates that multi-modal thermal-RGB inspection systems outperform single-modality approaches by 45-60% on industrial anomaly detection benchmarks, with the temporal dimension (video rather than snapshots) contributing an additional 20-30% improvement in early fault detection.

Sensor Configuration and Collection Methodology

The collection rig pairs a FLIR A700 radiometric thermal camera (640x480 LWIR, 7.5-14 micrometer spectral range, temperature accuracy within +/-2 degrees C) with a machine vision RGB camera (FLIR Blackfly S, 1920x1080, global shutter) on a rigid aluminum mounting bracket. Both cameras are factory-calibrated and the thermal-to-RGB geometric transformation is computed via checkerboard calibration using a heated calibration target visible in both modalities. Temporal synchronization is maintained within 2ms through hardware trigger.

Collection occurs in operational industrial facilities during normal production activities. Collectors -- trained thermographers holding Level I or Level II certifications from the Infrared Training Center or equivalent -- follow established inspection routes that cover critical equipment: electrical panels, motor-driven equipment (pumps, compressors, fans), steam and process piping, heat exchangers, boilers, transformers, switchgear, and HVAC systems. Each inspection route is documented with an equipment manifest and baseline thermal profiles.

The dataset spans 10+ industrial facility types: petrochemical refineries, power generation plants (gas turbine, steam, solar), water treatment facilities, food processing plants, pharmaceutical manufacturing, data centers, steel mills, cement plants, automotive manufacturing, and general industrial parks. Each facility type has distinct thermal signatures, ambient temperature ranges, and equipment configurations that industrial inspection robots must learn to interpret.

Environmental metadata for every session includes ambient temperature, humidity, wind speed (for outdoor equipment), solar loading conditions, equipment operational state (startup, steady-state, shutdown, maintenance), and the time since last maintenance for each inspected asset. This metadata enables researchers to build models that account for environmental factors when assessing thermal anomalies -- a motor running 10 degrees above ambient in a 40-degree C plant is very different from the same reading in a 20-degree C plant.

Comparison with Public Datasets

How Claru's thermal industrial dataset compares to publicly available alternatives for industrial inspection and predictive maintenance AI.

DatasetClipsHoursModalitiesEnvironmentsAnnotations
FLIR ADAS (2019)~26K imagesN/A (stills)Thermal + RGBDriving scenesVehicle, person detection
InfraParis (2023)~5K imagesN/A (stills)Thermal + RGBBuilding facadesBuilding segmentation
MVTec AD (2019)~5K imagesN/A (stills)RGB onlyFactory partsAnomaly masks
Claru Thermal Industrial25K+180+Thermal + RGB (video)10+ facility typesAnomalies, equipment ID, temperatures, severity, baselines

Annotation Pipeline and Quality Assurance

Thermal annotation requires specialist expertise that general-purpose annotators lack. Stage one automated processing extracts radiometric temperature data from the FLIR camera's native format, generates thermal-to-RGB alignment maps, and applies a baseline temperature model that flags pixels exceeding expected temperature ranges for each equipment category. Automated equipment segmentation uses DINOv2 on the RGB channel (which provides better spatial features for equipment identification than thermal alone).

Stage two human annotation is performed by ITC-certified thermographers who interpret thermal patterns in their industrial context. They annotate: equipment identification and type classification (50+ categories including motors, pumps, transformers, switchgear, valves, heat exchangers, and piping), thermal anomaly classification following NETA/ASTM standards (hot spot, cold spot, moisture intrusion, insulation failure, phase imbalance, overloaded circuit), severity rating on a 4-point scale (monitor, investigate, urgent, critical), and baseline comparison notes indicating whether the thermal signature represents normal operation, degradation, or acute failure.

Stage three QA is performed by Level II thermographers who verify anomaly classifications against the radiometric temperature data and equipment specifications. Severity ratings must achieve 100% agreement for critical-rated anomalies (these would trigger immediate shutdown in a real inspection). Overall inter-annotator agreement targets: 95%+ on equipment identification, 92%+ on anomaly classification, and 100% on critical severity flags. False negatives (missed anomalies) are treated more seriously than false positives in the QA process.

The complete annotation taxonomy covers 50+ equipment categories, 12 thermal anomaly types per NETA standards, 4 severity levels, radiometric temperature measurements (absolute and delta-T above reference), equipment operational state, and temporal anomaly progression markers (stable, worsening, intermittent). This enables training models that not only detect anomalies but assess their severity and recommend appropriate maintenance responses.

Use Cases

Autonomous Industrial Inspection

Training mobile inspection robots (Spot, ANYmal, Elios) to conduct thermal surveys of industrial facilities, identifying equipment anomalies and prioritizing maintenance actions. The paired thermal-RGB data with temporal context enables robots to build facility-wide equipment health models.

Predictive Maintenance AI

Training anomaly detection and remaining-useful-life estimation models from thermal time-series data. Models learn the temporal thermal signatures that precede equipment failures -- bearing degradation, insulation breakdown, electrical phase imbalance -- enabling maintenance before failure occurs.

Industrial Safety Monitoring

Real-time detection of thermal safety hazards: overheating equipment, steam leaks, insulation failures exposing hot surfaces, and abnormal process temperatures. Critical for facilities where thermal events can escalate to fires, explosions, or chemical releases.

Key References

  1. [1]FLIR Systems. FLIR Thermal Dataset for Algorithm Training.” FLIR Technical Report 2019, 2019. Link
  2. [2]Bergmann et al.. MVTec AD -- A Comprehensive Real-World Dataset for Unsupervised Anomaly Detection.” CVPR 2019, 2019. Link
  3. [3]Luo et al.. Autonomous Industrial Inspection with Quadruped Robots.” ICRA 2024, 2024. Link
  4. [4]Wang et al.. Deep Learning for Thermal Anomaly Detection in Industrial Equipment.” IEEE Transactions on Industrial Informatics 2023, 2023. Link

How Claru Delivers This Data

Claru's collector network includes ITC-certified thermographers working across 10+ industrial facility types. Unlike academic thermal datasets that provide only snapshots, Claru captures continuous thermal-RGB video that preserves the temporal patterns essential for predictive maintenance -- how thermal signatures evolve during equipment startup, load changes, and degradation cycles.

Custom campaigns can target specific facility types (refineries, power plants, data centers), equipment categories (electrical systems, rotating machinery, process piping), anomaly types (for augmenting datasets that lack specific failure modes), or seasonal conditions (summer vs. winter baseline comparisons). Turnaround is typically 6-8 weeks due to facility access coordination requirements.

Data is delivered with full radiometric temperature data preserved (not just pseudo-color visualizations). Thermal-RGB alignment matrices, camera calibration files, and equipment manifests accompany every delivery. Formats include RLDS, HDF5, WebDataset, and custom schemas with radiometric data in TIFF or NumPy array formats.

Frequently Asked Questions

Full radiometric data is preserved. Every thermal pixel includes an absolute temperature measurement (accuracy +/-2 degrees C), not just relative intensity. This enables quantitative anomaly detection models that assess severity based on actual temperature rather than visual contrast.

10+ types including petrochemical refineries, power plants (gas, steam, solar), water treatment, food processing, pharmaceutical manufacturing, data centers, steel mills, cement plants, automotive manufacturing, and general industrial parks.

Geometric alignment is computed via heated checkerboard calibration visible in both modalities. Temporal synchronization within 2ms via hardware trigger. Alignment matrices and camera intrinsics/extrinsics are provided with every delivery for custom re-projection.

12 types following NETA/ASTM standards: hot spots, cold spots (indicating flow blockage), moisture intrusion, insulation failure, electrical phase imbalance, overloaded circuits, bearing degradation, steam/gas leaks, refractory failure, fouling, and abnormal process temperatures. Each includes severity rating and delta-T measurement.

Yes. Custom campaigns can target specific equipment types or anomaly categories. For rare failure modes, Claru can coordinate with facility maintenance teams to capture data during planned maintenance windows when failed components are accessible for thermal imaging.

Request a Sample Pack

Get a curated sample of paired thermal-RGB industrial video with equipment health annotations to evaluate for your inspection or predictive maintenance project.