Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,963 datasets
Multiclass land cover semantic segmentation data is provided as PNG images sized 512x512 pixels. The dataset's author, organization, and specific geographic or temporal coverage are unknown. Its size, row count, and file formats are also unspecified.
A collection of images likely intended for training object detection models, specifically for identifying snakes. The dataset is hosted on Kaggle, but its exact size, creation date, and authorship are unknown. Its title suggests the images may be pre-cropped for use with the YOLO (You Only Look Once) object detection framework.
Socraic Evaluator is a dataset hosted on Kaggle. Its title suggests it may contain data for evaluating AI or large language model responses, potentially using a Socratic method of questioning. The dataset's specific content, size, and origin are not detailed in the available metadata.
A dataset for object detection tasks, published on Kaggle. The specific contents, scale, and creation details are not provided in the available metadata. Users must download the dataset to verify its exact composition and suitability for their projects.
Settlement data from the Seattle Office for Civil Rights tracks discrimination cases resolved through pre-determination settlements, conciliations, and private withdrawals with benefits. The dataset provides monthly counts of settled cases from 2017 onward, sourced from the City of Seattle. It was last updated on March 15, 2026.
From 2017 to the present, this dataset records the monthly count of technical assistance requests provided by the Seattle Office for Civil Rights (SOCR) Enforcement Division. The data is published by the City of Seattle and was last updated in March 2026. It likely contains monthly timestamps and counts of assistance events.
Chemical, physical, and profile data from the Atlantic Long Lines (AJAX) Expeditions in the North Atlantic, South Atlantic, and Southern Oceans from October 1983 to February 1984. The dataset includes measurements of dissolved inorganic carbon, total alkalinity, chlorofluorocarbons, nutrients, and other variables. It was collected by Taro Takahashi of Lamont-Doherty Earth Observatory using CTD, Coulometer, and bottle instruments.
Nocross is a computer vision dataset published on Kaggle. Its specific content and scale are not detailed in the available metadata. Users must download the dataset to verify its exact composition and potential applications.
A dataset titled 'Seeker-keypoint' published on Kaggle. The title suggests it contains images with annotated keypoints, likely for tasks like human pose estimation or object part localization. No further metadata, such as size, source, or creation date, is available.
Brats2024-multichannel contains over 100,000 axial slices of multi-spectral brain MRI scans. The data is provided in YOLO format, which is commonly used for object detection tasks. The scans include FLAIR, T1c, and T2w sequences.
Over 100,000 axial slices of grayscale FLAIR MRI scans formatted for YOLO object detection. The dataset is designed for training models to identify brain tumors. Its author, organization, and last update date are unknown.
A map documents the distribution and locations of pataali kuans (stepwells) within the 19th-century princely state of Jaisalmer. The dataset includes pargana (administrative division) boundaries for historical context. Palak Babel created this resource, which was last updated in April 2026.
Car Dent and Parts Detection Using YOLOv8 is a dataset for instance segmentation tasks. It contains images annotated for 29 classes related to vehicle parts and damage. The dataset is hosted on Kaggle and is formatted for use with the YOLOv8 object detection model.
Metaregistar is a catalog of electronic registries and information systems managed by state bodies, state administration organs, state agencies, state funds, and other public authorities. The dataset is provided by the Ministry of Public Administration and was last updated on March 31, 2026. It is available for download in Excel XLSX format.
A synthetic dataset of 4,000 Thai document images for OCR model training. It includes 1,000 general text samples, 1,000 table samples (invoices, budgets), and 2,000 official document samples (contracts, legal, police reports). The dataset was created by mekpro and last updated on March 15, 2026.
Global Islamic Pilgrimage Dataset (2000–2026) contains worldwide statistics for Hajj, Umrah, and Ziyarat pilgrimages. The dataset was sourced from Kaggle, but the author, organization, and specific collection method are unknown. The temporal coverage spans from the year 2000 to 2026.
ENA24 is a dataset of 8,789 camera trap images with 9,772 bounding box annotations for 23 wildlife species from Eastern North America. The images are 1920x1080 resolution and annotations are in COCO format. It was originally collected by the University of Missouri and distributed via LILA BC, and is hosted on Hugging Face by davanstrien.
tf_efficientnetv2_l_in21ft1k is a set of pre-trained neural network weights for image classification tasks. The dataset is hosted on Kaggle, a platform for data science and machine learning. Its specific source, creation date, and the size of the training data are not detailed in the provided metadata.
Steganogan Dataset V3 is a dataset related to steganography and generative adversarial networks, published on Kaggle. The dataset likely contains image data for training or evaluating models that hide information within other data. Specific details regarding its size, creation date, and authorship are not provided in the available metadata.
Sarah Day's dataset on figshare contains qualitative research data on organizational and structural factors affecting personalization in cervical cancer care. The data, last updated March 25, 2026, is a 9.5 KB XLS file. It is based on patient journey mapping with 31 participants, including voices of individuals 69 years old and older.