Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,446 datasets
Statistical analysis of serum protein expression across canine health groups. The dataset was authored by Poonavit Pichayapaiboon and last updated on May 15, 2026. It compares healthy dogs to those with myxomatous mitral valve disease (MMVD) at different stages and chronic kidney disease (CKD).
30.2 KB of data summarizing analytical methods, including descriptive statistics and chi-square tests. The dataset, authored by Ram Bahadur Khadka and last updated on May 8, 2026, examines relationships between variables such as public vehicle type and antibiotic resistance patterns. It is available under a CC-BY-4.0 license.
Hourly in-situ measurements and derived quantities from a macroalgal reef at Helgoland, Germany (54.18Β°N, 7.88Β°E), spanning the full annual cycle of 2025. Variables include sea-surface temperature, salinity, dissolved oxygen, photosynthetically active radiation, wind speed, and water and atmospheric partial pressures of COβ and CHβ. The dataset was authored by Bryce Van Dam and includes a second file with hourly gross production, respiration, and net ecosystem metabolism estimates derived from dissolved oxygen.
5.5 KB of data contains the final weights from a genetic algorithm optimization process for agricultural wholesale market location selection. The dataset was created by Guizhe Xin and uploaded to figshare in April 2026. It supports a case study evaluating seven candidate location schemes.
Results from 100 Y-randomization iterations per model-predictor-target combination to assess predictive model validity. The dataset includes metrics like R2_fit_random, Q2_LOOCV_random, and p_value_random for models fit on randomized target variables. Authored by Mythili V and shared on figshare in April 2026.
A paper-based dataset presents quantitative comparisons of the Hermite Weighted Essentially Non-Oscillatory (HWENO) method using different numerical fluxes for solving hyperbolic conservation laws. The study by Liu Hongxia of Taiyuan University of Technology systematically investigates performance metrics like CPU cost, accuracy, and non-oscillatory properties, focusing on one-dimensional systems. The HWENO method is a high-order, high-resolution scheme for simulations with discontinuous or sharp gradient solutions.
A dataset describing the discovery of benzothiophene difluoromethyl phosphonate (DFMP) inhibitors with potent dual PTPN1/PTPN2 activity. The data, authored by Xiao Mei Zheng and last updated on 2026-04-29, includes results from structure-based design, vector optimization, and mechanistic studies identifying SLC19A1 as a key transporter.
An open-source R software package implementing a Bayesian Poisson Non-Negative Matrix Factorization (NMF) algorithm. The package includes models and plotting capabilities for analyzing count data, with applications focused on cancer mutational signatures. It was authored by Jenna M. Landy and published on figshare in April 2026.
Isabel T. Held published a statistical analysis dataset on figshare in May 2026. The 5.5 KB XLS file contains data for a linear mixed effects model evaluating hearing thresholds. The model assesses the main and interactive effects of time (Week) and ear tested (left or right), with a random effect for mouse.
Synthetic and processed empirical data supporting the manuscript "A Physical Theory of National Resilience." The dataset includes code and data to replicate key findings and figures, such as phase diagrams and early-warning signal analysis. It was authored by Feng Xie and archived on Harvard Dataverse in May 2026.
Ilayda Akkor's dataset, last updated on 2026-04-24, presents an integration method for chemical modeling in desalination process optimization. It describes the use of the Reaktoro-PSE package to incorporate detailed water chemistry into the WaterTAP techno-economic assessment platform. The dataset likely contains results validating this approach and demonstrating its use in cost-optimization problems for different feedwater compositions and chemical combinations.
A 2026 dataset by Juan Ramirez on figshare contains experimental and mathematical modeling data for dynamic covalent chemistry. The work investigates the transfer of structural units through imine/amine exchanges (transiminations) in solution and under solvent-free conditions, modulated by pH stimuli. The dataset includes results for reactions involving pyridine-derived aldehydes with aromatic and aliphatic amines, complemented by equilibrium-constants-based mathematical models.
Fitzroy River Basin, Queensland, Australia, sediment sources have been identified and quantified using an integrated geochemical and modelling approach. The dataset likely contains geochemical composition data and Bayesian model outputs revealing changes in catchment contributions over time. It was published by Geoscience Australia Data and last updated on 2026-04-30.
NASA's OCO-2 mission provides high-quality space-based retrievals of atmospheric carbon dioxide (XCO2). This dataset uses a data assimilation technique to synthesize satellite observations with GEOS model simulations, producing gap-filled global gridded estimates. The data is produced by NASA's Global Modeling and Assimilation Office (GMAO) using the GEOS CoDAS system.
Yafeng Yao's dataset contains prediction results for the Cerchar Abrasivity Index (CAI) and Cerchar Abrasivity Ratio (CAR) of weakly cemented sandstones. It is a small dataset (5.5 KB) derived from an improved fuzzy stochastic RBF neural network model. The data was last updated in April 2026.
A 9.5 KB dataset containing experimental results from Cerchar abrasivity tests on weakly cemented sandstones in western China. It was created by Yafeng Yao to train an improved fuzzy stochastic RBF neural network model for predicting rock abrasivity indices. The dataset was last updated in April 2026.
Validation group training data samples for predicting rock abrasivity indices. The dataset was created by Yafeng Yao and published on figshare in April 2026. It is a small dataset at 5.5 KB, stored in an XLS file.
Cerchar abrasivity index tests on weakly cemented sandstones in western China show the dry state increases the CAI by 19.23% compared to the water-saturated state. The dataset contains training results for an improved fuzzy stochastic RBF neural network model predicting rock abrasivity. It was authored by Yafeng Yao and uploaded to figshare in April 2026.
A computational model dataset for intracellular calcium dynamics in human ventricular myocytes, focusing on stochastic gating and release variability. The dataset, authored by Gustavo Montes Novaes and published on figshare in April 2026, presents a Scalable Aggregate Calcium Release Unit (SA-CaRU) model integrating a Markov Chain-based description of L-type Calcium Channels. It enables systematic exploration of calcium release variability as a function of microdomain size and coupling under healthy and phosphorylated conditions.
Adverse event reports for two prostate cancer drugs were analyzed using the FDA Adverse Event Reporting System (FAERS). The dataset includes 3,357 reports for Degarelix and 4,075 reports for Relugolix from Q1 2009 through Q2 2024. The supplementary material was authored by figshare admin karger and published under a CC-BY-4.0 license.