26,260 science questions paired with 6,206 images sourced from CK-12 Foundation's open educational resources. The dataset includes both text-only and diagram-based visual reasoning questions for middle school science. It was uploaded by 'notefill' to HuggingFace and last updated on 2025-11-21.
Use Cases
- Train multimodal question-answering models based on the described science questions and textbook images.
- Benchmark visual reasoning capabilities of AI systems based on the diagram-based questions mentioned.
- Develop educational AI assistants based on the dataset's content from middle school science textbooks.
- Study the integration of text and images in educational assessment based on the multimodal structure.
Strengths
- 26,260 questions provide a substantial corpus for training or evaluation.
- 6,206 images offer a significant visual component for multimodal tasks.
- Includes both text-only and diagram-based visual reasoning questions, covering multiple question types.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Freshness should be verified; last updated 2025-11-21.
Provenance
- Source
- CK-12 Foundation's open educational resources.
- Freshness
- Last updated 2025-11-21 15:51:53.