Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,963 datasets
A video classification dataset for American Sign Language (ASL) recognition, published on Kaggle. The dataset includes pre-defined splits for training and evaluation. Specific details on the number of videos, collection timeframe, and original authors are not provided in the available metadata.
Tatar-language documents form a page-level benchmark for optical character recognition. Each item includes the original page image, structured OCR and layout annotations in JSONL, and a visual control image for verification. The dataset was created by yasalma and last updated on March 31, 2026.
A 1977 report details chemical analyses of manganese nodules from the Cape Leeuwin field off Western Australia. Data includes metal values recalculated to account for a measured average moisture content of 16 percent. The findings originate from Geoscience Australia.
A record of annual data security and permissible use audits conducted by the Washington State Department of Licensing on recipients of its shared data. The dataset includes audit types, dates, and recipient details. It is hosted on the data.wa.gov platform and was last updated on February 23, 2026.
2026-03-07 updated grids integrate land, marine, and satellite data for the southwest quadrant of Australia (24-46S, 106-140E). The Australian Geological Survey Organisation produced these bathymetry, gravity, and magnetic datasets in cooperation with Desmond Fitzgerald & Associates and the Australian Hydrographic Office.
YOLO-Data is a dataset hosted on Kaggle, likely containing images for training and evaluating object detection models based on the YOLO (You Only Look Once) architecture. The dataset's author, organization, and specific contents are not detailed in the provided metadata. Its size, format, and last update date are also unknown.
Metrcigan is a dataset hosted on Kaggle. Its specific content and scale are unknown from the provided metadata. The title suggests a focus on computer vision tasks, but the data's origin, size, and features require verification after download.
MetricGAN-Kanen-Enhanced is a dataset hosted on Kaggle. The title suggests it relates to a Generative Adversarial Network (GAN) model, likely named 'MetricGAN' or 'Kanen', designed for audio signal enhancement tasks. The dataset's specific content, size, and origin are not detailed in the provided metadata.
Aiocr Assistant is a dataset published on HuggingFace by author Asem75. The dataset was last updated on 2026-05 08 12:15:02. Its specific content and structure require verification after download.
Radiant MLHub is an open library for geospatial training data hosted by the Radiant Earth Foundation. It aggregates datasets contributed by the foundation's team and its partners, stored using the SpatioTemporal Asset Catalog (STAC) standard and exposed via a common API. The datasets include pairs of imagery and labels for machine learning tasks like image classification, object detection, and semantic segmentation.
RGB road images paired with binary masks for identifying potholes. The dataset is hosted on Kaggle and is intended for semantic segmentation tasks in computer vision. Its author, organization, and specific scale are not detailed in the provided metadata.
Win-RVQ-GAN-code is a dataset hosted on Kaggle. Its title suggests it contains code related to a Vector Quantized Generative Adversarial Network (VQ-GAN). The specific contents, scale, and authorship are not detailed in the provided metadata.
CNN_ASD_CEKPO is a computer vision dataset published on Kaggle. Its title suggests a potential focus on convolutional neural networks (CNNs) and possibly Autism Spectrum Disorder (ASD) analysis. The specific content, scale, and origin require verification after download.
NWPU-VHR-10-YOLO is a computer vision dataset hosted on Kaggle. The title suggests it contains satellite or aerial imagery annotated for object detection tasks, likely using the YOLO model format. Metadata is minimal; actual content requires verification after download.
An organized and verified version of the BraTS 2024 challenge datasets, including three tumor types. The collection contains 2,728 total cases of glioma, meningioma, and pediatric brain tumors. It was prepared by the author Spirit-26 and last updated on the platform in March 2026.
A 5.5 KB Excel file containing comparative performance metrics for classification models. The dataset includes accuracies, Cohen's Kappa scores, and AUCs for models trained on the PANDA medical imaging dataset versus those trained on ImageNet. It was authored by Michail Georgios Papachristos and last updated in March 2026.
Kaggle hosts a dataset titled 'scoliosis2yolov6'. The title suggests it contains medical images related to scoliosis, a spinal condition, formatted for use with the YOLOv6 object detection model. The dataset's author, size, and specific collection details are unknown.
Bird count database provides monthly observations organized by geographic zones. The dataset, authored by Carlos Lazo and last updated in March 2026, is stored in an XLSX file of 28.6 KB.
Lira District, Northern Uganda provides survey data on factors associated with emergency contraceptive pill use among women aged 15-49. The dataset, created by Josephine Vanessa Nakalema, is a 5.5 KB Excel file last updated in March 2026.
Josephine Vanessa Nakalema published this dataset in March 2026, capturing the attitudes of women aged 15-49 in Lira District, Northern Uganda. The 9.5 KB Excel file contains tabular data on public opinion and women's health topics.