Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
16,023 datasets
SinGAN is a dataset published on Kaggle. The dataset likely contains images for training a single-image generative adversarial network model. Metadata is minimal; specifics about size, origin, and creation date are unknown.
A dataset named 'mbart-vi-ocr-adaptation-254000' hosted on Kaggle. The title suggests it contains text data, likely for adapting the mBART multilingual model to Optical Character Recognition tasks, potentially involving Vietnamese language. The dataset's specific content, size, and origin are not detailed in the provided metadata.
Images of corn seeds, likely for assessing quality and identifying defects. The dataset is hosted on Kaggle. Its specific size, collection date, and creator are unknown.
Tree ring width data from Saganaga Lake in Minnesota, USA, provides a paleoclimate record. The chronology covers 345 years, from 307 to -38 calendar years before present. The dataset was archived by NOAA NCEI's World Data Service for Paleoclimatology, with a last recorded update in 1988.
Tree ring core samples from Saganaga Lake, Minnesota, provide a 272-year chronology from 234 to -38 calendar years before present. The dataset is part of the NOAA/WDS Paleoclimatology archive, contributed by researcher Graumlich and resampled in 1988. It documents parameters related to tree growth for reconstructing past environmental conditions.
ARC-AGI-3-V34-Warm-CNN is a dataset published on Kaggle. Its title suggests a focus on computer vision, potentially related to the Abstraction and Reasoning Corpus (ARC) challenge for AGI. The dataset's specific content, size, and creation details are not provided in the available metadata.
A dataset of images generated by a Projected Generative Adversarial Network (GAN), likely for the VASC domain. It is hosted on Kaggle, but the author, organization, and specific creation date are unknown. The dataset's size, row count, and exact content require verification after download.
Projected-GAN AK Generated likely contains synthetic images produced by a Generative Adversarial Network (GAN) model. The dataset is hosted on Kaggle, but its specific size, creator, and update date are unknown. Columns suggest it may include generated image files and associated metadata.
Projected-GAN SCC Extended Generated is a dataset hosted on Kaggle. The title suggests it contains images generated by a Projected-GAN model, likely for computer vision tasks. Metadata is minimal; actual content requires verification after download.
Kaggle hosts a dataset titled Yolov12, which likely contains images for object detection tasks. The dataset's specific size, source, and update history are not provided in the available metadata. Its content and structure must be verified after download.
Kaggle hosts a dataset titled 'datasetyolo'. The dataset's content likely relates to object detection, inferred from the title's reference to the YOLO (You Only Look Once) model architecture. The author, organization, and specific collection details are not provided in the available metadata.
A dataset hosted on Kaggle, likely containing images for object detection tasks related to football. The dataset's author, organization, and specific temporal coverage are unknown. Metadata is minimal; actual content requires verification after download.
Protected designation of origin wine samples were tested by the official certification entity CVRVV between May 2004 and February 2007. The data were recorded by a computerized system (iLab) managing the process from producer requests to laboratory and sensory analysis. Each entry denotes a test, with the goal of using chemical analysis to determine wine quality.
A balanced collection of CT-scan images for lung cancer classification. The dataset includes images labeled as Benign, Malignant, and Normal. It was sourced from Kaggle, but the author, organization, and specific collection details are unknown.
21 August 2021 report details a main-track train derailment involving Canadian National Railway Company Train B73041-15 at Mile 18.9 of the Napadogan Subdivision near Pangburn Station, New Brunswick. The Transportation Safety Board of Canada authored this official safety investigation report, which is available in HTML format.
Rail transportation safety investigation report R21M0027 details a main-track train derailment involving Canadian National Railway Company Train B73041-15. The incident occurred on 21 August 2021 at Mile 18.9 of the Napadogan Subdivision near Pangburn Station, New Brunswick. The report is authored by the Transportation Safety Board of Canada.
One detailed report documents a collision with a cable involving a privately registered Bellanca 7GCBC aircraft near Shawinigan, Quebec on July 17, 2022. The Transportation Safety Board of Canada authored this official investigation report. It was last updated in March 2026.
A single official report details a main-track freight train derailment involving Canadian Pacific Railway Company. The Transportation Safety Board of Canada produced this investigation report for an incident that occurred on January 3, 2019, in Partridge, British Columbia. The report is published as an HTML document.
AmeriFlux carbon flux data for the University of Michigan Biological Station site. The site is a protected forest of mid-aged northern hardwoods, conifer understory, aspen, and old-growth hemlock, with a history of logging and wildfires. The dataset was contributed by author Peter S. Curtis from Virginia Commonwealth University.
A global monthly climatology of total inorganic carbon and total alkalinity on a 1Β° x 1Β° x 32-layer grid. The dataset was created by Catherine Goyet of UniversitΓ© de Perpignan using interpolation methods applied to high-quality data from programs like WOCE, JGOFS, and OACES. It is designed to initialize three-dimensional ocean-atmosphere carbon dioxide models.