AgroCoT is a Chain-of-Thought benchmark for evaluating reasoning abilities in Vision-Language Models (VLMs) for agriculture. It contains 4,759 curated samples designed to test logical reasoning and problem-solving, particularly in zero-shot scenarios. The dataset was created by author wenyb and is hosted on HuggingFace.
Use Cases
- Benchmarking zero-shot reasoning performance of VLMs based on the described logical reasoning tasks.
- Evaluating problem-solving abilities of multimodal AI models in agricultural contexts.
- Training or fine-tuning VLMs to improve their chain-of-thought reasoning on agricultural problems.
Strengths
- Contains 4,759 carefully curated samples for evaluation.
- Specifically designed to test logical reasoning and problem-solving in zero-shot scenarios.
Limitations
- Column-level documentation is absent; field semantics must be inferred after download.
- Row count is unknown, which may limit suitability assessment.
- Description metadata is limited; actual data quality requires manual inspection after download.
Provenance
- Source
- HuggingFace, author wenyb.
- Freshness
- Last updated 2026-03-20 08:12:41; freshness should be verified.