Loading...
Loading...
News corpora, social media analysis, movie/music metadata, sports data, cultural datasets, misinformation
10,305 datasets
One of five released datasets, this surface geology compilation maps alkaline and related igneous rocks of Proterozoic age across Australia. Geological units are represented as polygon and point geometries, attributed with stratigraphic, age, lithology, and compositional data. The dataset is produced by Geoscience Australia Data and was last updated on 2026-04-12.
48 randomized controlled trials involving 3,699 participants were analyzed to determine optimal Baduanjin exercise parameters for glycemic and lipid control. The meta-analysis, conducted by Shiying Zhang and registered in PROSPERO, suggests a regimen of 40โ45 minutes per session, three times weekly, for 24โ48 weeks. Results show significant reductions in fasting blood glucose, glycated hemoglobin, and triglycerides.
17.9 KB Excel file serves as an additional file for a systematic scoping review protocol on lifestyle diseases. Published on figshare by Marcin Golec under a CC-BY-4.0 license, it was last updated on 2026-05-15. The specific content and structure of the data within the file are unknown.
A review of published works on AI implementation and ethics in libraries and archives. Sara Mannheimer searched library and information science databases in fall 2022 and summer 2023, focusing on case studies from the United States and Canada. The dataset includes an inventory of analyzed literature sources and R code for analysis.
Three benchmark text-classification datasets support a study analyzing active learning algorithms. The data includes cleaned datasets, collected annotator files, and experimental splits from a recently accepted paper. Author varuntotakura published this repository on Hugging Face in May 2026.
PerCoR is a large-scale Persian benchmark for commonsense reasoning in a 4-choice sentence-completion format. It contains approximately 106,000 examples sourced from over 40 Persian websites across domains like news, culture, lifestyle, tech, religion, and travel. The dataset was created by author mina8113 and last updated on Hugging Face in May 2026.
C. J. Hunt's 2026 meta-analysis synthesizes evidence from 7 independent studies with a total sample size of 751,156 participants. It reports pooled odds and hazard ratios for the association between HSV-2 infection and Alzheimer's disease or all-cause dementia. The findings indicate no clear association with Alzheimer's and mixed evidence for all-cause dementia.
Monthly HDF data files provide a digitized record of global lightning signatures identified as streaks on filmstrip imagery from the DMSP Operational Linescan System. The dataset covers specific months across a 19-year period from 1973 to 1991, compiled by the GHRC DAAC. This archive represents an early satellite-based effort to catalog lightning activity.
Mars Express Radio Science data collected by NASA from 2010-01-01 to 2012-12-31. This dataset provides global gravity measurements for Mars, with a specific time slice from 2012 09 18T04:35:43.000 to 2012 09 18T06:12:20.500. The data is stored in HDF5 format and was last updated on the platform in 2026.
Replication data for a provisional academic paper on fiscal rules. The data includes Cofog-structured government expenditures for Brazil, other Latin American countries, and a sample of OECD nations, as well as tax expenditure data from the GTED. The dataset was authored by Fernando Moutinho Ramalho Bittencourt and last updated on May 24, 2026.
A 2014 version of the main cultural-historical structure map for the Dutch province of Drenthe, adopted by the Provincial Staten van Drenthe on 2 July 2014. The dataset appears as map 2F: Core Quality Cultural History and is provided by the Dutch Ministry of the Interior and Kingdom Relations. It is intended to guide planning and ensure cultural-historical cohesion for the future.
Physical Glass greenhouse horticulture 2019 is a geospatial dataset derived from a GIS analysis combining BAG buildings and BRT TOP10NL data. The dataset is provided by the Ministerie van Binnenlandse Zaken en Koninkrijksrelaties and is licensed under CC-PDM-1.0. The specific last update date and data volume are unknown.
A 2014 version of the main cultural-historical structure lines adopted by the Provincial Staten van Drenthe on 2 July 2014. The dataset, from the Dutch Ministry of the Interior and Kingdom Relations, focuses on ensuring cultural historical cohesion for the future, using structures like Frederiksoord and Veenhuizen as inspiration. It appears as map 2F: Core quality cultural history.
Adopted on 16 November 2020, this dataset defines a horticultural concentration area under the Dutch Environmental Regulation NH2020. The plan is published by the Ministry of the Interior and Kingdom Relations and can be consulted via the spatialplans.nl platform. It is part of a formal decision (No: 1477251/1478383) and references specific legal articles.
Decision No: 1477251/1478383 adopted on 16-11 2020 defines this spatial scope for greenhouse horticulture. The dataset is provided by the Dutch Ministry of the Interior and Kingdom Relations and is available in WFS, PNG, and WMS formats under a CC-PDM-1.0 license. Further details about the scope can be found in article 6.37 of the regulation.
A spatial plan adopted on 16 November 2020 (Decision No: 1477251/1478383) as part of the Dutch Environmental Regulation NH2020. The dataset, published by the Ministry of the Interior and Kingdom Relations, defines the scope for provincial monuments and can be consulted via spatialplans.nl, with legal details in articles 4.57 to 4.59.
Vectorized greenhouse horticulture data derived from 1997 aerial photography and 1:10,000 scale topographic maps. The dataset is provided by the Dutch Ministry of the Interior and Kingdom Relations under a public domain license. It is available in multiple geospatial formats including WMS and WFS.
Supplementary tables from a study comparing gene expression in human podocytes and brain/kidney organoids. The data includes lists of 600 and 344 genes, along with results from Gene Ontology, transcription factor, and Metascape enrichment analyses. The dataset was authored by Wasco Wruck and last updated on April 11, 2026.
A summary of empirical data sources used for model calibration and validation in a set of reviewed studies. The dataset is a 5.5 KB Excel file authored by Manting Wang and last updated on April 30, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
High-resolution CTD/STD data was collected by the ENDEAVOR research vessel in the Caribbean Sea and North Atlantic Ocean from April 11 to April 28, 1985. The dataset was processed by the National Oceanographic Data Center (NODC) into its standard F022 format. It provides nearly continuous vertical profiles of ocean parameters.