Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,350 datasets
All data from a systematic review on digital parenting and children's digital wellbeing outcomes. The dataset, authored by Xinyu Dou, is available under a CC-BY-4.0 license and was last updated on April 26, 2026. It is a small collection of 69.4 KB, stored in DOCX and XLSX file formats.
Six testbed results for a cartographic line-simplification algorithm designed for nautical charting. The algorithm constrains line displacement, segment length, and bend geometry using scale-driven parameters expressed in millimeters. The work was authored by Christos Kastrisios and uploaded in April 2026.
Average distances in kilometers to nine types of sports facilities, including gyms, football fields, and swimming pools. The dataset is provided by the Dutch Ministry of the Interior and Kingdom Relations via the EU open data platform. The data is available as web feature and map services (WFS, WMS).
SPSS and STATA files containing data for replicating a manuscript on educational divides among U.S. House of Representatives staff. The data was authored by Phillip Ardoin and is hosted on Harvard Dataverse. The record was last updated on June 12, 2026.
Expression data for peb3::astA transcriptional fusions in wild-type Campylobacter jejuni and mutant strains. The dataset was contributed by author Hendrixson, David and is hosted by the Texas Data Repository via the Dataverse platform. It was last updated on June 8, 2026.
Comparia Conversations is one of the largest prompt and text completion datasets in French. It originates from Compar:IA, a conversational AI comparison tool developed by the French Ministry of Culture. The dataset was last updated on April 29, 2026.
63.2 MB of experimental data supporting the development of an endogenous promoter-driven expression system for functional gene study in the silkworm pathogen Nosema bombycis. The dataset, authored by Mengxian Long and last updated in May 2026, is shared under a CC-BY-4.0 license and includes files in ZIP and XLSX formats.
A dataset listing sports pitches and playing fields in Belfast. It includes information such as name, address, longitude, and latitude. The data is provided by the Government Digital Service under an open government license.
A cultural-historical valuation map for the municipality of Assen, serving as Annex 3 to the RAP Report 2876. The dataset is provided by the Dutch Ministry of the Interior and Kingdom Relations and is available under a CC-BY-4.0 license. The last update date is unknown.
A geospatial dataset identifying culturally and historically valuable settlements in the Dutch province of Drenthe. The data originates from Map A.7.2 of the Drenthe Streekplan, a regional plan adopted by provincial states on 27 June 1990. The dataset is provided by the Dutch Ministry of the Interior and Kingdom Relations under a public domain license.
A geospatial dataset from the Netherlands delineating sub-areas of the Cultural History Compass. The data is provided by the Dutch Ministry of the Interior and Kingdom Relations (Ministerie van Binnenlandse Zaken en Koninkrijksrelaties) and is available under a Creative Commons Public Domain Mark 1.0 license. The sub-areas are bordered based on common landscape characteristics.
The Synthetic Latvia Passports Dataset contains over 1,000 AI-generated passport images. All records are fully synthetic and do not correspond to real individuals. The dataset was created by ud-synthetic and was last updated on Hugging Face in May 2026.
The Synthetic Qatar Passports Dataset assembles more than 1,000 AI-generated passport images crafted for training OCR and computer vision models on identity documents. All data is synthetically generated and does not correspond to real individuals. The dataset was created by ud-synthetic and was last updated on 2026-05-13.
More than 1,000 AI-generated Irish passport images created by ud-synthetic, last updated on 2026-05-13. The dataset contains fully synthetic records with fictional personal details, intended for research and development in document analysis.
More than 1,000 AI-generated Taiwanese passport images created for training OCR and computer vision models. All records are fully synthetic and do not correspond to real individuals, as stated by the author ud-synthetic. The dataset was last updated on 2026-05-13.
More than 1,000 AI-generated passport images are compiled for training OCR and computer vision systems. All records are fully synthetic, containing no real personal data. The dataset was created by ud-synthetic and last updated on 2026-05-13.
More than 1,000 AI-generated passport images comprise this fully synthetic dataset. Created by ud-synthetic, it is designed for training OCR and computer vision systems on identity documents. The dataset was last updated on May 13, 2026.
Four oceanographic surveys collected in the Eastern North Atlantic Ocean during Spring 1991, Winter 1992, Winter 1993, and Spring 1994. This dataset contains temperature, salinity, pressure, and Acoustic Doppler Current Profiler (ADCP) data from CTD and current meter casts, mapped onto specific potential density surfaces. It was produced by the NOAA National Centers for Environmental Information as part of the Subduction Accelerated Research Initiative.
Conductivity-temperature-depth (CTD) casts collected during a joint U.S./China oceanographic program provide high-resolution vertical profiles of temperature, salinity, pressure, and dissolved oxygen in the East China Sea. Data were gathered from the vessel SCIENCE-1 over four days in July 1984 by the Woods Hole Oceanographic Institution. The profiles are processed to the NODC F022 standard format, reporting measurements at depth intervals as fine as one meter.
78 studies encompassing approximately 12,400 patients across 12 cancer types were synthesized in this systematic review and meta-analysis. The work by Chunhui Liu, last updated in March 2026, analyzes the role of the SETD2 gene in metabolic reprogramming and response to immunotherapy. It reports pooled odds and hazard ratios linking SETD2 loss to altered tumor metabolism and reduced clinical benefit from immune checkpoint inhibitors.