A multimodal dataset from HuggingFace provides synchronized vision and tactile glove sensor data across distinct tasks. The dataset includes RGB video at 30 Hz and 720p resolution, lossless 16-bit depth streams, monochrome camera views, and per-frame aligned tactile data in Parquet format. It was created by touchtronix and last updated on March 16, 2026.
Use Cases
- Training multimodal AI models based on synchronized RGB video and tactile sensor streams.
- Developing sensor fusion algorithms based on aligned depth data and tactile glove readings.
- Benchmarking robotic manipulation tasks based on synchronized vision and tactile data.
- Calibrating sensor systems based on intrinsic and extrinsic calibration data mentioned in the description.
Strengths
- Data streams are synchronized across modalities.
- Tactile data is stored in high-performance Parquet format.
- Depth data is provided as a lossless 16-bit stream.
- Includes intrinsic and extrinsic calibration data.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- touchtronix
- Freshness
- Last updated 2026-03-16 16:29:51; freshness should be verified.