Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,459 datasets
Uganda is the geographic focus of this simulated dataset for validating a Bayesian spatiotemporal model of neonatal mortality. The data contains monthly district-level rates of newborn deaths within the first 28 days of life, covering a 14-year period from January 2010. It was created by George Bamwebaze and shared under a CC-BY-4.0 license.
A 5.5 KB Excel file containing statistical comparisons of different batch sizes against a best-performing model using 128 batch size. The dataset, authored by Ashik Mostafa Alvi, was last updated on April 20, 2026, and is shared under a CC-BY-4.0 license on figshare.
A dataset comparing the impact of features on TM use across different statistical methods. The dataset is a 9.5 KB Excel file authored by Pascal Petit and last updated on April 20, 2026. It is licensed under CC-BY-4.0.
5.5 KB of statistical analysis plans authored by Melaku Haile Likka, last updated on April 20, 2026. The dataset contains detailed plans for each research objective, structured in an Excel spreadsheet. It is shared under a CC-BY 4.0 license on the figshare platform.
Bayesian Gamma Regression Estimates of Mean and Dispersion Parameters for Maternal Age at First Birth by Region, Religion, and Residence in Ethiopia. The dataset was authored by Adimias Wendimagegn Agegnehu and last updated on 2026-04 -13. It is a 9.5 KB Excel file available under a CC-BY-4.0 license.
15.8 KB Excel file containing full statistics for phylogenetic ANOVA and two-block partial least squares tests. The dataset, authored by Linnea Lungstrom, was last updated on April 17, 2026. Its specific statistical results likely support research in evolutionary biology and comparative methods.
16.6 GB of neural and behavioral recordings from mice performing an odor discrimination task. The dataset, authored by Luis Boero and last updated in March 2026, captures responses to stochastic odor pulses across dozens of breaths. It includes measurements from olfactory sensory neurons and anterior piriform cortex neurons, correlating perceptual weight with respiration phase.
407,806 unique compact and extended X-ray sources are cataloged in this release from the Chandra X-ray Observatory. The Chandra Source Catalog version 2.1.1, updated in October 2024, provides over 100 uniformly calibrated positional, spatial, photometric, spectral, and temporal properties for each source, derived from 1,304,376 individual observation detections through the end of 2021. It is produced by the Chandra X-ray Center at the Harvard-Smithsonian Center for Astrophysics.
A synthetic dataset from a physiologically-based pharmacokinetic (PBPK) modeling study by Hyunseo Park, published in March 2026. The research simulates tigecycline exposure in plasma, epithelial lining fluid (ELF), and major organs to optimize inhaled dosing for Mycobacterium abscessus pulmonary infections. It includes model-based predictions for efficacy and safety thresholds across multiple dosing scenarios.
Dandan Guo from Huazhong University of Science and Technology authored a paper on the exponential stabilization of the wave equation with acoustic boundary conditions. The work uses Lyapunov and Riemannian geometry methods and applies the main theorem to wave equations with memory type acoustic boundary conditions. The paper includes an example application.
A mathematical paper by Žarko Mijajlović of the University of Belgrade presents a method for representing the inverse function of the cosmological scale factor a(t) as an elliptic integral. The work uses algebraic dependencies between cosmological parameters to compute special events in the universe's evolution in a uniform way. The dataset likely contains derived parameters or computational results supporting the paper's theoretical framework.
A theoretical work by G. Jothilakshmi of Alagappa University proposes an algebraic framework for analyzing fractional singular systems. The paper introduces a modern class of linear fractional singular delay systems with two orders and a decomposition method for matrix regular pencils. It includes a procedure for computing the reachable set and control input, illustrated with examples.
Noureddine Bouteraa of Université Oran 1 Ahmed Ben Bella authored a paper studying mild solutions of a fractional partial differential equation disturbed by multiplicative white noise. The work employs techniques of semigroup theory, Hausdorff measure, and Darbo fixed point theorem. The dataset likely contains mathematical or simulation results supporting this analysis.
A mathematical paper by Inès Feki from the University of Sfax proposes new supplements to linear operator perturbation theory. The work involves a non-analytic perturbation with multiple parameters and applies the theory to a Gribov operator in Bargmann space.
30 randomized controlled trials (N = 2,124) were analyzed to evaluate the dose-response effects of exercise on inflammatory biomarkers in overweight and obese postmenopausal women. The dataset, created by Gang Huang and last updated in March 2026, contains meta-analysis results from a systematic search of five databases up to January 2026. It includes standardized mean differences for biomarkers like CRP and TNF-α, with meta-regression results for exercise volume, duration, and intensity.
A meta-analysis of 30 randomized controlled trials (N = 2,124) investigating the dose-response effects of exercise on inflammatory biomarkers in overweight and obese postmenopausal women. The study, authored by Gang Huang and last updated in March 2026, systematically searched five databases and used meta-regression to analyze relationships between exercise parameters and biomarkers like CRP and TNF-α. Results indicate structured exercise significantly reduces TNF-α and CRP, with a trend suggesting higher intensity is more effective.
550,000 reasoning traces were distilled from the KIMI-K2.5 language model on high-reasoning tasks. The collection includes 2 billion tokens and is distributed across coding (60%), science (15%), math (10%), computer science (5%), logical questions (5%), and creative writing (5%). It was created by ansulev and last updated on Hugging Face in April 2026.
A geochemical dataset from estuarine sediment samples in Broad Sound, Queensland, analyzed using Q-mode and R-mode factor analysis, discriminant analysis, and regression. The data was published by Geoscience Australia Data and was last updated in March 2026. It identifies processes controlling concentrations of P2O5, Cu, Pb, and Zn in supratidal and intertidal zones.
This dataset contains experimental data from the optimization of 4-aniline substituted pyrido[3,2-d]pyrimidine derivatives as dual Pim/Mnk kinase inhibitors. It includes synthesized compounds with measured IC50 values against Mnk1, Mnk2, and Pim1 kinases, along with associated antiproliferative effects, solubility, and pharmacokinetic profiles. The data supports the development of compound 2j, a novel inhibitor with demonstrated in vivo antileukemia activity in MOLM-13 xenograft models.
1,000 Chinese multiple-choice math problems form this dataset, each annotated with a gold answer and a detailed rationale. It was introduced in the paper 'Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models'. The dataset was authored by SallyTan and last updated on Hugging Face in April 2026.