LLaVA Annotations for PASCAL VOC: Visual Question Answering Labels
Available on 1 platform
Sign in to view source links and access this dataset
Description
llava_annotations_pascal_voc is a dataset hosted on Kaggle. The title suggests it contains annotations, likely for images from the PASCAL VOC dataset, generated or used by the LLaVA (Large Language-and-Vision Assistant) model. The dataset's specific content, size, and creation details are not provided in the available metadata.
Use Cases
Fine-tuning vision-language models for image understanding (inferred from domain, verify after download)
Benchmarking visual question answering systems on a standard object detection dataset (inferred from domain, verify after download)
Training or evaluating models that generate textual descriptions from visual inputs (inferred from domain, verify after download)
Strengths
Published on Kaggle, a major platform for data science resources.
Limitations
Metadata is minimal; actual content requires verification after download.
Row count, file formats, and column definitions are unknown, limiting suitability assessment.
License and authorship details are absent, which may affect usage rights.