Loading...
Loading...
Image classification, object detection, segmentation, face recognition, OCR, image generation, video understanding
15,988 datasets
FF++_C23_Organised_Train_Val_Test is a preprocessed version of the FaceForensics++ dataset. It provides face crops from the C23 (high quality) compression level, organized into training, validation, and test splits. The dataset is hosted on Kaggle, but specific details on the creator, license, and exact size are not provided in the input metadata.
San Francisco Ethics Commission data on employees of registered campaign consultants, starting from December 9, 2024. Information is created when consultants electronically file Form 1 Registration or Form 2 Re-registration reports. The dataset is updated via the Socrata Publisher API as new filings are submitted and signed.
Seatbelt-detection4yolov11 is a dataset published on Kaggle. Its title suggests it contains images for training object detection models, specifically for seatbelt detection tasks. The dataset's author, organization, size, and specific contents are unknown from the provided metadata.
Environmental Information Data Centre data detail leaf litter decomposition rates from an experiment in eight Welsh upland rivers across moorland and conifer forest sites. The study, conducted from 2012 to 2013, examined responses to deciduous leaf additions versus control reaches, with sampling before and after additions.
Kaggle hosts this computer vision dataset titled 'yoloswinv2tshopee14'. The title suggests it is likely an object detection dataset, potentially related to e-commerce imagery from Shopee. Its specific content, size, and creation details are not provided in the available metadata.
A computer vision dataset hosted on Kaggle, likely containing images for object detection tasks. The dataset's title suggests it may be associated with the YOLO and Swin Transformer model architectures. Its specific content, scale, and origin require verification after download.
A dataset published on Kaggle, likely for training and evaluating object detection models. The title suggests it may be related to the YOLO (You Only Look Once) model family, possibly version 2 or an efficient variant. Specifics regarding the number of images, annotation types, and source are unknown from the provided metadata.
YoloMobileNetShopee14 is a dataset hosted on Kaggle. Its title suggests it may contain images for object detection tasks, likely related to e-commerce products. The dataset's specific content, size, and origin are not detailed in the provided metadata.
Kaggle hosts this dataset titled 'yolomobilenetnaver14'. The dataset's platform tags suggest it is related to object detection using the YOLO and MobileNet architectures. Its author, organization, and specific content details are not provided in the metadata.
Inorganic element data from surface seabed sediments (0-2 cm depth) in the Timor Sea's Petrel Sub-basin. The survey was conducted in May 2012 by the RV Solander as part of the National Low Emission Coal Initiative to investigate CO2 storage potential in shallow (<100m) marine environments.
Two major seabed swath-mapping and geophysical surveys, AUSTREA-1 and AUSTREA-2, were completed in early 2000 by the Australian Geological Survey Organisation. The data was commissioned to support Australia's Ocean Policy, specifically for developing the South-east Regional Marine Plan and establishing marine protected areas. The coverage includes Lord Howe Island, the South-east Australian Margin, Tasmania, the South Tasman Rise, and the Central Great Australian Bight.
Kaggle hosts this dataset of drone-captured images of solar panels. The dataset is intended for training the YOLOv8 object detection model. The specific content, size, and collection details require verification after download.
AGSO marine surveys cover a 1,000 km section of the Cape York Peninsula inner shelf between Weipa and Cape Flattery. The data, collected in 1992/1993, supports the Cape York Land Use Strategy Project to assess the region's natural resources.
MWS Vision Bench is the first Russian-language business OCR benchmark designed for multimodal large language models. The validation split is publicly available for open evaluation and comparison, with a paper expected soon. The dataset was uploaded by MTSAIR and last updated on March 11, -2026.
Roadway blocks represent street segments for a city, a common organizational unit for work like fixing potholes. The dataset is maintained by the District of Columbia Department of Transportation (DDOT) and was last updated on March 25, 2026. It includes road types such as streets, with segments remaining unbroken despite intersections.
May 31 to July 25, 2017, satellite products from the Geostationary Operational Environmental Satellite 13, collected during the NASA Convective Processes Experiment field campaign. The campaign involved sixteen DC-8 aircraft missions from May 27 to June 24, 2017, in the North Atlantic-Gulf of Mexico-Caribbean Sea region. Data are available in netCDF-3 format.
Corporate registration records from the District of Columbia's Department of Licensing and Consumer Protection. The dataset includes business entity details such as file number, entity status, business name, address, and report filing dates. It is maintained by the DC Corporations Division as the official Office of Corporate Registrar.
An Individual Participant Data Meta-Analysis synthesizing evidence from 67 datasets across 27 armed conflicts. The research, authored by Joan BarcelΓ³ and hosted on Harvard Dataverse, investigates the association between exposure to wartime violence and religiosity. It was last updated in March 2026.
Mingda Wang's 2026 meta-analysis dataset, 28.6 MB in size, compiles evidence on how ants influence soil carbon cycling and organic matter stability. The dataset, released under a CC-BY-4.0 license, likely contains tabular data from aggregated studies for statistical synthesis. It supports research into the role of soil fauna in biogeochemical processes.
Kaggle hosts a dataset titled 'yoloswinv2tnaver12'. The title suggests it is likely related to computer vision and object detection, potentially using a YOLO and Swin Transformer V2 architecture. The dataset's author, organization, and specific contents are unknown.