Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
16,012 datasets
Authorization and Termination Statements for campaign consultants in San Francisco, filed electronically starting December 9, 2024. Data is updated via the Socrata Publisher API as form filings are submitted and signed. The unique row identifier is 'Envelope Id', and not all columns contain data depending on the 'TypeOfStatement'.
Turkish fan-translated light novels from the Baka-Tsuki project and other recoverable sources. The dataset contains 4 series, 58 chapters, and 31,924 line-level records, totaling 1,370,532 characters and 187,360 words. It was created by soundstarrain and last updated on Hugging Face in March 2026.
Meganet199 is a dataset published on Kaggle. The title suggests a focus on computer vision tasks. The dataset's content, scale, and specific origin require verification after download.
A dataset hosted on Kaggle, likely containing dynamic texture videos for training 3D convolutional neural networks. The dataset appears to be based on the DynTex++ benchmark for dynamic texture analysis. Specific details on size, creator, and update date are not provided in the available metadata.
A dataset related to Generative Adversarial Networks (GANs) hosted on Kaggle. The specific content, size, and creator are unknown from the provided metadata. The dataset's last update date and detailed structure are not specified.
YOLO Checkpoint is a dataset published on Kaggle, likely containing model weights for a YOLO (You Only Look Once) object detection model. The dataset's specific contents, such as the model version, training data, or performance metrics, are not detailed in the provided metadata. Its author, organization, and last update date are unknown.
Replication data supports findings on public organization contracting during extreme weather events. The dataset is aggregated and de-identified for reproducing tables and figures from the associated publication. Author Tipeng Chen provided the data via Harvard Dataverse, last updated in April 2026.
20 aragonite samples were precipitated in vitro from seawater between September 2021 and December 2022. The data contains amino acid compositions for samples precipitated with and without specific biomolecules like aspartic acid and glycine. Researchers from the British Geological Survey collected this data to study how calcification fluids affect aragonite precipitation.
A Front End Engineering and Design report chapter provides cost estimates for a carbon capture and storage demonstration project at the Longannet Power station in Scotland. The study refined capital cost accuracy from -30%/+50% to approximately -12%/+15% during its development phase. It was produced by the Scottish CCS Consortium in 2010 and is hosted by the British Geological Survey.
The Department of Housing Preservation and Development (HPD) reports on projects, buildings, and units counted towards the Housing New York (2014-2021) and Housing Our Neighbors (2022-present) plans. The dataset includes 19 columns detailing project timelines, unit counts by income category, and program details. Data is provided by data.cityofnewyork.us and was last updated on February 13, 2026.
TusGAN is a dataset published on Kaggle. The title suggests it is related to generative adversarial networks (GANs), likely containing generated or training images. The dataset's specific content, size, and origin are not detailed in the provided metadata.
Synthetic CCTV Retail Shoplifting Dataset with YOLO & VLM annotations. The dataset is designed for computer vision tasks in retail security. Its origin, size, and creation details are unspecified.
Object Detection Public Dataset is a collection of images for computer vision tasks, published on Kaggle. The dataset likely contains images annotated for object detection. Metadata is minimal; specifics on size, source, and annotations require verification after download.
yolo-lab5 is a dataset hosted on Kaggle. Its title suggests it contains data for training or evaluating YOLO (You Only Look Once) object detection models. The dataset's specific content, size, and origin are not detailed in the available metadata.
Sam's object detection using tensor flow is a dataset hosted on Kaggle. The dataset likely contains images and annotations for training object detection models using the TensorFlow framework. Specific details regarding the number of images, annotation format, and source are not provided in the available metadata.
A catalog of 12,455 solar flares observed between 1 May 2010 and 9 October 2017. It was compiled by Dr. Ryan Milligan through retrospective analysis of the known pointing of seven different space-based solar observatories. The dataset records the start time, end time, and which instrument(s) observed each flare.
A recaptioned image dataset featuring the character Lisa from the video game Genshin Impact. The dataset is intended for training Stable Diffusion and FLUX LoRA models. Its specific size, creation date, and authorship details are not provided.
A collection of images featuring Mai Shiranui, a character from The King of Fighters video game series, with new captions. The dataset is hosted on Kaggle and is intended for training Low-Rank Adaptation (LoRA) models. The author, organization, and specific data volume are unknown.
A recaptioned image dataset of the character Clorinde from the video game Genshin Impact. The dataset is intended for training Stable Diffusion or FLUX Low-Rank Adaptation (LoRA) models. Its author, size, and last update date are unknown.
A real-world mosquito image dataset intended for object detection and fine-grained species classification tasks. The dataset originates from Kaggle, but its author, organization, and creation date are unknown. The total number of images, file formats, and specific license details are not provided.