A dataset from Kaggle, likely containing precomputed image embeddings or model cache files. The title suggests it is derived from the COCO dataset, potentially using YOLO and CLIP-ViT-16 models, and may be associated with the 'Karpathy' split. The specific content, scale, and authorship require verification after download.
Use Cases
- Benchmark image retrieval systems using precomputed embeddings (inferred from domain, verify after download)
- Fine-tune or analyze vision-language models on COCO-derived features (inferred from domain, verify after download)
- Compare performance of different embedding architectures (YOLO vs. CLIP) (inferred from domain, verify after download)
Limitations
- Metadata is minimal; actual content requires verification after download.
- Row count, file formats, and license are unknown.
- Column-level documentation is absent; field semantics must be inferred after download.
Provenance
- Source
- Kaggle
- Collection Method
- Likely derived from the COCO dataset using YOLO and CLIP-ViT-16 models.
- Time Range
- null
- Freshness
- Last updated date is unknown; freshness unverified.
- Geography
- null