Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,949 datasets
35,000 square miles of the Cairns-Townsville hinterland were mapped from 1956 to 1959, revealing a complex geological history. The data documents a Precambrian shield and a flanking Palaeozoic geosynclinal zone containing up to 40,000 feet of sediments. The work was conducted by Geoscience Australia.
3349 public comments submitted to the OECD on tax evasion regulations form the core of this dataset. It was created by Margaret Kenney for a study published in the British Journal of Political Science. The data supports analysis of firm influence in international regulatory processes.
Wood Image Dataset for Wood Type is a dataset hosted on Kaggle. The title suggests it contains images of wood, likely for the purpose of classifying different wood types. The dataset's specific scale, origin, and creation date are not provided in the available metadata.
Model outputs from an EfficientNet-B4 architecture used as a teacher in a knowledge distillation experiment. The dataset is hosted on Kaggle, but its specific content, size, and creation details are not described. The title suggests it contains results from a training run, likely including predictions or intermediate representations.
A dataset of real receipt images intended for optical character recognition tasks, published on Kaggle. The dataset likely contains images of receipts and corresponding text annotations. Specific details on the number of samples, collection method, and time period are not provided in the available metadata.
Dung-VinDr-CXR-YOLO is a dataset of chest X-ray images, likely intended for object detection tasks. The dataset is hosted on Kaggle, but specific details about its size, annotation format, and origin are not provided in the available metadata. Users must download the dataset to verify its exact content, scale, and licensing terms.
A dataset of traffic sign images formatted for training YOLO object detection models. The dataset is hosted on Kaggle, but its specific size, annotation details, and creation date are not provided in the available metadata. The title suggests it is likely derived from or related to the Tsinghua-Tencent 100K traffic sign benchmark.
plant_resnet34_teacher_40epoch is a dataset published on Kaggle. The title suggests it contains model weights for a ResNet34 neural network trained for 40 epochs, likely for a plant-related image classification task. The dataset's specific contents, such as the number of classes or the source of the training images, are not detailed in the available metadata.
A dataset of neural network weights, likely for a convolutional neural network (CNN) model. It was published on Kaggle, but the specific author, organization, and creation date are unknown. The dataset's exact size, structure, and intended application are not detailed in the available metadata.
A dataset likely related to speech enhancement using a MetricGAN-BSRBF model. It is published on Kaggle, but detailed metadata such as author, size, and specific contents are not provided. The dataset's exact scope and creation date are unknown.
A dataset titled 'zelda-yolo-main' is hosted on Kaggle. The dataset's title suggests it is likely related to object detection using the YOLO framework, potentially for imagery from the 'Zelda' video game series. No further metadata, such as author, size, or sample data, is provided.
Conser-vision Practice Area is an image classification dataset hosted on Kaggle. The dataset likely contains images intended for practicing computer vision model development. Its specific content, size, and creation details are not provided in the available metadata.
The dataset's temporal coverage is unknown. It appears to contain data on poverty levels, likely for the region referred to as Negri Karangan HytamPutyh. The dataset is hosted on Kaggle, but the author, organization, and specific data collection details are not provided.
A dataset for pedestrian detection in urban environments, likely formatted for use with the YOLO object detection model. It is hosted on the Kaggle platform. The specific size, collection method, and creation date are unknown from the provided metadata.
Replication data and code for the study 'Love Blinds? Winners, In-party Favoritism, and Support for Violations of Democratic Norms' published in the British Journal of Political Science. The dataset was contributed by author Yu-Shiuan Huang and last updated on May 5, 2026. It likely contains survey or experimental data related to political behavior and attitudes.
100 annotated scanned page images from the Advocates Library manuscript index card collection at the National Library of Scotland. The dataset provides bounding box annotations for training object detection models to locate index cards on pages.
Elemental concentrations of carbon, nitrogen, phosphorus, and potassium are measured in the coarse roots, fine roots, leaves, stems, and branches of the shrub Rhododendron thymifolium across the QinghaiβTibet Plateau, with corresponding soil physicochemical properties. The dataset supports research into plant ecological stoichiometry and nutrient cycling in high-altitude environments. It is structured as a tabular dataset with a file size of 48,540 bytes.
Yolotest is a dataset published on Kaggle. The title suggests it is likely used for testing object detection models, particularly those based on the YOLO (You Only Look Once) architecture. No further details on size, source, or creation date are available from the provided metadata.
YoloTrain is a dataset published on Kaggle. Its title suggests it contains images intended for training YOLO (You Only Look Once) object detection models. The dataset's specific contents, size, and origin are not detailed in the provided metadata.
SMUGGLEBENCH is a multimodal safety benchmark accompanying the paper 'Making MLLMs Blind: Adversarial Smuggling Attacks in MLLM Content Moderation'. It is designed to study whether Multimodal Large Language Models can identify harmful text hidden, obfuscated, or disguised within images. The dataset was created by author zhihengli-casia and was last updated on Hugging Face in April 2026.