Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
16,012 datasets
Kaggle hosts a dataset titled 'model_yolo'. The dataset's content is inferred to relate to the YOLO (You Only Look Once) object detection framework. Its specific contents, size, and creation details are not provided in the available metadata.
Chandra OCR 2 Cache is a dataset hosted on Kaggle. Its title suggests it contains data related to optical character recognition, likely derived from or associated with the Chandra X-ray Observatory. The dataset's specific content, size, and structure are not detailed in the available metadata.
Cauca YOLO Pose is a dataset published on Kaggle. Its title suggests a focus on object detection and pose estimation, likely using the YOLO framework. The dataset's specific content, size, and origin require verification after download.
A dataset for training object detection models, specifically for identifying tires. The data is hosted on Kaggle and is likely formatted for use with the YOLOv8 framework. The dataset's author, organization, and specific scale are unknown.
Experimental results likely from a computer vision project comparing or evaluating the YoloSwin model. The dataset is hosted on Kaggle, but its specific origin, size, and creation date are unknown. It appears to contain performance metrics or logs from object detection experiments.
YOLOv8-TrOCR combines object detection and optical character recognition models. The dataset likely contains images with bounding box annotations and corresponding text transcriptions. It is hosted on Kaggle, but its specific content and scale require verification.
YOLO Trained Weights MTMCT is a dataset of pre-trained model weights for the YOLO object detection architecture. The weights are likely intended for tasks involving multi-target, multi-camera tracking scenarios. Published on Kaggle, the dataset's specific source, creation date, and detailed contents require verification after download.
TrOCR GT Manual appears to be a dataset for optical character recognition tasks, likely focusing on handwritten text. It is hosted on Kaggle, a platform for data science competitions and projects. The dataset's specific content, size, and creation details are not provided in the available metadata.
Sugarcane Leaf Image Dataset is a collection of images of sugarcane leaves, likely intended for computer vision tasks. It was published on Kaggle, but details about its size, collection method, and authorship are unknown. The dataset's last update date and specific contents require verification after download.
A ResNet model achieving 92.2% performance on the VinBigData chest X-ray dataset. The dataset likely contains medical images for training and evaluating computer vision models. It is hosted on Kaggle, but specific details about the data's size and structure are not provided in the metadata.
A cache of precomputed features for a computer vision model pipeline. The dataset likely contains intermediate representations from models like YOLO and CLIP-ViT-B/16, generated on the COCO Karpathy split. It is hosted on Kaggle, but the exact size, format, and creation details are unspecified.
KhocRoi is a dataset published on the Kaggle platform. The dataset's content is inferred to be related to computer vision from its title. No further metadata regarding size, source, or creation date is available.
HorizonMath is a benchmark for measuring AI progress in mathematical discovery through automated verification, as described in a 2026 arXiv paper by Erik Y. Wang and colleagues. The dataset was created by 'squashenthus' and last updated on Hugging Face in March 2026. It focuses on evaluating AI systems' ability to generate and verify mathematical statements.
AmeriFlux FLUXNET-1F US-MMS provides carbon flux data for the Morgan Monroe State Forest in Indiana. The data was processed using the standard ONEFlux (1F) software by the AmeriFlux Management Project, with site management by Indiana University and the Indiana Department of Natural Resources. The forest is a secondary successional broadleaf forest within the eastern deciduous forest transition zone.
China and Vietnam data on candidates for promotion during the 18th and 11th Party Congresses. The dataset contains original data on Internet search queries and media coverage for contenders, created by Dimitar D. Gueorguiev of Syracuse University. It was used to analyze the role of public profiles and elite-mass linkages in authoritarian promotion contests.
414 individuals from a northeastern city were interviewed for this mixed-methods study of ethnic family organization. The data includes married Italian-Americans, their spouses, a Protestant control group, and older Italian immigrants, all collected by Colleen L. Johnson. Interviews covered filial relationships, kinship solidarity, marital relations, and socialization practices.
AmeriFlux carbon flux data for the US-MMS site at Morgan Monroe State Forest in Indiana. The data is provided by Indiana University under a long-term agreement with the Indiana Department of Natural Resources. The forest is a secondary successional broadleaf forest with trees 60-80 years old, located in the maple-beech to oak hickory transition zone.
Replication materials support a forthcoming article in the Journal of European Public Policy. The dataset and STATA do file were authored by researcher Carl Henrik Knutsen. The data was last updated in April 2026.
2,800 young Chinese firms were randomized into small groups for a one-year study on business network effects. The dataset, organized by Jing Cai of the National Bureau of Economic Research, captures outcomes including an 8.1 percent revenue increase from monthly manager meetings. Effects persisted one year post-intervention and were shaped by peer quality and conversation content.
An original dataset of 197 nonconcessional International Monetary Fund loans to 47 countries between 1984 and 2003, compiled by Mark S. Copelovitch of the University of WisconsinβMadison. It was used to analyze variation in IMF lending policies, including loan size and conditionality, based on a common agency theory involving G5 countries.