Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,469 datasets
Gaurav Sood Dataverse hosts supplementary files for replicating the study 'Social Proof is in the Pudding: The (Non)-Impact of Social Proof on Software Downloads'. The dataset contains large files referenced by the associated GitHub repository. Its cross-platform presence on DataCite and Dataverse indicates it is a recognized replication resource for social science research.
A 5.5 KB Excel file summarizing similarity and distance metrics for multi-omics graph construction. It was authored by Masrafe Bin Hannan Siam and last updated on March 19, 2026. The dataset likely contains a comparison of methods, including their mathematical ranges, core capabilities, and tunable hyperparameters.
PISA 2022 data analyzed to explore relationships between intellectual curiosity, self-efficacy, perseverance, and mathematical problem-solving ability. The 5.5 KB Excel file, authored by Yongzhao Wang, was last updated on March 19, 2026. It provides empirical evidence on psychological mechanisms in standardized educational contexts.
A 9.5 KB Excel file containing pairwise stochastic dominance probabilities, sorted by the number of models beaten. The dataset was authored by Lawrence Fulton and last updated on March 19, 2026. It is licensed under CC-BY-4.0 and hosted on figshare.
12.2 KB of raw data supporting bar graphs and statistical analyses, likely related to diabetes research. The data was uploaded by Jiao Liu to figshare and last updated on March 19, 2026. It is licensed under CC-BY-4.0.
Goedel-Prover-V2 SFT Dataset contains 1,745,010 samples for supervised fine-tuning in formal theorem proving. The dataset was created by Goedel-LM and is associated with a 2025 arXiv preprint. It was last updated on the Hugging Face platform in March 2026.
Ben Rosche of the Political Analysis Dataverse published this replication dataset in 2026 to address dependencies in coalition government research. It provides simulation and empirical data on coalition survival to validate a Multiple Membership Multilevel Model.
558,482 bytes of raw data and statistical analyses support all figures in a manuscript submission. The dataset likely contains measurements related to avoidance behavior, goal-directed control, and chemogenetic interventions using tools like DREADDs or Salvinorin B. It is structured in an Excel workbook, facilitating direct examination of the underlying experimental results.
Net change in housing units from new buildings, demolitions, and alterations for New York City Council Districts. The data is derived from Department of Buildings-approved construction jobs filed or completed since January 1, 2010. It provides the 2010 census housing unit count, net change in Class A units, and units pending completion for political and statistical boundaries.
Since 2010, this dataset tracks the net change in housing units from new construction, demolitions, and alterations for New York City's Neighborhood Tabulation Areas (NTAs). It is aggregated from the NYC Department of City Planning's Housing Database, which is derived from Department of Buildings-approved jobs. The current version is 25q4, with all previous versions available on the DCP website.
Net change in housing units from new buildings, demolitions, and alterations for NYC Community District Tabulation Areas (CDTAs) since 2010. The data is aggregated from the NYC Department of City Planning's Housing Database, derived from Department of Buildings-approved construction and demolition jobs. It provides the 2010 census housing unit count, net change in Class A units, and units pending completion.
948 topsoil samples form the basis for Spearman correlation coefficients between heavy metal(loid) concentrations and soil characteristics. The dataset, authored by Holly L. Heafner, includes statistical significance indicators for correlations at p < 0.05. It was last updated on March 18, 2026.
De-identified data from a retrospective cohort study on peptic ulcer disease in the Baltic region. The 36.0 KB Excel file was authored by Abdulrahman Al-Dawoudi and last updated in March 2026. Platform tags suggest the study assessed seasonal patterns and independent predictors using statistical methods like multivariable logistic regression.
SEDdata contains scenario-emotion-behavior triples for psychological research. The dataset includes statistical analysis results and is authored by Yi-bo Chen. It is available in JSONL, XLSX, and TXT formats under a CC0 license.
A 454.4 KB dataset containing scenario-emotion-behavior records and experimental results. It includes statistical analysis and was authored by Yi-bo Chen. The dataset was last updated in March 2026.
An approach for Bayesian age-depth modelling that reconstructs accumulation histories for deposits dated by Pb-210. The method, developed by Maarten Blaauw and described in Aquino et al. (2018), can integrate Pb-210, radiocarbon, and other dates into chronologies. The underlying code is derived in part from the 'rbacon' package by the same authors.
A permutation-based hypothesis test for comparing two networks, developed by Claudia D. van Borkulo and described in a 2021 publication. The method assesses differences based on several invariance measures, including network structure, global strength, edges, and centrality. It is suited for both independent and dependent sample comparisons and uses l1-regularization for network estimation.
A software package providing tools for users and developers of Bayesian models. The package, authored by Paul‐Christian Bürkner, offers methods for converting, manipulating, and summarizing draws from posterior or prior distributions. It includes lightweight implementations of state-of-the-art posterior inference diagnostics.
Propagation of uncertainty using higher-order Taylor expansion and Monte Carlo simulation. Calculations are based on matrix calculus including covariance structure, referencing established methodologies from Arras 1998 (first order), Wang & Iyer 2005 (second order), and BIPM Supplement 1 (Monte Carlo). The dataset was authored by Andrej-Nikolai Spiess.
Supplementary tables from a study linking parasitic infection to faster social learning in a foraging task. The dataset, authored by Salamatu Abdu, contains the statistical model outputs referenced in the research. It was last updated on figshare in April 2026.