DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

3DSRBench: A 3D Spatial Reasoning Benchmark with 2,772 VQAs | DataSalon

Home Multimodal & LLM3DSRBench: A 3D Spatial Reasoning Benchmark with 2,772 VQAs

Multimodal & LLM

3DSRBench: A 3D Spatial Reasoning Benchmark with 2,772 VQAs

Name: 3DSRBench: A 3D Spatial Reasoning Benchmark with 2,772 VQAs
Creator: ccvl
Published: 2025-01-02T00:00:39
Keywords: 3d Spatial Reasoning, Vision Language, Benchmark, Computer Vision, Vqa, Synthetic Data, Synthetic, Multimodal

by ccvl·Updated 1y ago

Available on 1 platform

Description

3DSRBench is a manually annotated benchmark for evaluating 3D spatial reasoning in large multimodal models. It contains 2,100 visual question-answering pairs on MS-COCO images and 672 on multi-view synthetic images rendered from HSSD. The dataset was created by author 'ccvl' and was last updated on the Hugging Face platform in February 2025.

Use Cases

Benchmarking 3D spatial reasoning in multimodal models based on annotated VQAs.
Training models to understand spatial relationships from 2D images.
Evaluating model performance on synthetic multi-view imagery for spatial tasks.

Strengths

Manually annotated 2,772 visual question-answering pairs.
Includes 672 VQAs on multi-view synthetic images, providing a controlled test environment.

Limitations

Column-level documentation is absent; field semantics must be inferred after download.
Row count is unknown, which may limit suitability assessment.
Description metadata is limited; actual data quality requires manual inspection after download.

Provenance

Source: ccvl on Hugging Face
Collection Method: Manually annotated visual question-answering pairs on MS-COCO and synthetic images rendered from HSSD.
Freshness: Last updated 2025-02-03 06:16:52; freshness should be verified.

Multimodal 3d Spatial Reasoning Vision Language Benchmark Computer Vision Vqa Synthetic Data Synthetic

Related Datasets

Quality Score

D36

Description

Source

Reputation

Quality Score

D36

Description

Source

Reputation

Access

Community

566 downloads

8 likes

0 views

Dataset Info

Author: ccvl
Created: Jan 2, 2025
Updated: Feb 3, 2025
Last synced: Jul 21, 2026

Access

Community

566 downloads

8 likes

0 views

Dataset Info

Author: ccvl
Created: Jan 2, 2025
Updated: Feb 3, 2025
Last synced: Jul 21, 2026

3DSRBench: A 3D Spatial Reasoning Benchmark with 2,772 VQAs

Description

Use Cases

Strengths

Limitations

Provenance

Related Topics

Related Datasets

Quality Score

Community

Dataset Info

Community

Dataset Info