Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,424 datasets
33,679 naturally occurring Reddit-based human experiences paired with self-disclosed emotion labels form the EXPRESS benchmark. Created by author bangzhao and detailed in a 2025 arXiv paper, this dataset uses emotions explicitly disclosed by the original authors as ground-truth labels. The dataset was last updated on Hugging Face in May 2026.
A multilingual news retrieval benchmark containing synthetic multihop queries across 10 languages. The dataset is sourced from recent news articles and is designed for evaluating dense retrieval and text embedding models on content post-dating typical model training cutoffs. It was created by jinaai and last updated on April 30, 2026.
Québec government press releases published on the open_canada platform. The dataset is licensed under CC-BY-4.0 and was last updated on 2026-04-17. The specific volume, time range, and content details require verification after download.
City of Gold Coast provides a geospatial layer depicting the approximate location of non-pressurised sewer pipes as of October 2015. The data is supplied under a CC BY license and includes a disclaimer regarding its accuracy and suitability for decision-making. The dataset was last updated on the platform in March 2026.
A 2026 literature review and case report analysis covering PubMed articles from 2016 to 2024 on Monomorphic Epitheliotropic Intestinal T-cell Lymphoma (MEITL). The document, authored by Xinlong Xu, presents three case studies and discusses the potential of the JAK-1 inhibitor Golidocitinib in combination with the GDP regimen to improve patient prognosis.
Ana Flavia F. Ferreira's systematic review and meta-analysis evaluates the effects of microglial depletion using CSF1R inhibitors like PLX3397 and PLX5622 in preclinical models of Alzheimer's and Parkinson's disease. The work synthesizes results from 26 Alzheimer's disease and 17 Parkinson's disease studies. It was last updated on March 17, 2026, and is shared under a CC-BY-4.0 license.
A 2026 systematic review and meta-analysis by Ana Flavia F. Ferreira, evaluating the effects of microglial depletion via CSF1R inhibitors in preclinical models of Alzheimer's and Parkinson's disease. The analysis synthesizes results from 26 Alzheimer's disease and 17 Parkinson's disease studies. The document is a 49.4 KB text file summarizing neuroprotective outcomes, behavioral results, and pathological changes.
Baseline participant characteristics from a study, with results expressed as frequencies and percentages for categorical variables and mean ± standard deviation for continuous variables. The dataset was authored by Tobia Zanotto and is available under a CC-BY-4.0 license. It was last updated on April 15, 2026.
A 5.5 KB Excel file listing research sub-questions to be answered based on a scoping review. The dataset was authored by Weiqi Wang and last updated on April 22, 2026. It is shared under a CC-BY 4.0 license on the figshare platform.
A 5.5 KB Excel file summarizing the characteristics of sources included in a scoping review. The dataset was authored by Abd Arrahman Alomar and last updated on April 22, 2026. It is licensed under CC-BY-4.0 and available for download from figshare.
Raw culture data from environmental samples analyzed using the IDEXX method. The dataset is a 24.1 KB XLSX file authored by Olivia A. Harmon and last updated on April 22, -2026. It is licensed under CC-BY-4.0 and hosted on figshare.
Vendor of Record arrangements for advertising and communications services across the Ontario government. The dataset lists contract names, qualified vendors, start dates, and end dates established through competitive open bidding. It is provided by the Advertising Review Board and updated by the Government of Ontario.
A dataset quantifying the spread of Fos protein expression from a fiber optic tip and measuring Fos protein levels in insula and orbitofrontal cortex sites. The data includes measurements from rats with ChR2, eYFP, and naive control conditions. It is a 751.3 KB XLSX file authored by Ileana Morales and last updated in April 2026.
A study dataset investigates digital 3D reconstruction of built heritage through media-driven cultural perception. It includes 2,942 public TikTok comments and comparisons of five image-to-3D algorithms using a multi-metric framework. The dataset was authored by Weihui Zhang and last updated in April 2026.
136.3 KB of supplementary material for a study on cesium vanadate (CsVO3). The data includes calculated lattice parameters and atomic coordinates for Pbcm and Pc phases, along with Raman spectra, frequencies, and assignments at ambient pressure. It was authored by Zhenfang Xing and last updated on April 22, 2026.
A dataset linking residential green space, air pollution, and related metabolites to depression outcomes among cancer survivors. The data is provided in an XLSX file sized 135.9 KB and was published under a CC-BY-4.0 license by Xue Li. It was last updated on April 22, 2026.
25,309,483 anonymized user records assembled from randomized advertising incrementality tests by Criteo AI Lab. Each row includes 11 anonymized features, a treatment indicator, and binary labels for visits and conversions. The dataset has been preprocessed and balanced for classification and clustering tasks.
Over 1,000 AI-generated Brazilian passport images created by ud-synthetic, with a dataset page last updated on 2026-05-06. All images and associated data are fully synthetic and do not correspond to real individuals, intended for research and development purposes.
More than 1,000 AI-generated passport images compose this dataset, created by ud-synthetic. All images and associated personal details are fully synthetic and do not correspond to real individuals, making them suitable for research and development. The dataset was last updated on May 6, 2026.
Vicmap Reference - Infrastructure Sports Type Table is part of the VMREFTAB set of reference tables for the VICMAP suite of products. It is published by the Department of Transport and Planning under a CC-BY-4.0 license. The dataset was last updated on 2026-04-09.