Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,452 datasets
Primary data from a manuscript on optimizing a bidirectional shrouded Archimedes spiral hydrokinetic turbine. The dataset was authored by Yonghua Sun and is shared under a CC-BY-4.0 license. It was last updated on May 11, 2026.
Shahriar Afandizadeh's dataset provides an overview of hyperparameter optimization across various models. The dataset is stored in an XLS file with a size of 5.5 KB and was last updated on May 5, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
A theoretical model from Tbilisi State University analyzes a three-dimensional thermo-electro-magneto-elastic solid embedded in an inviscid fluid. The work by George Chkadua proves uniqueness and existence theorems for the corresponding boundary-transmission problems using potential methods and pseudodifferential equations. The dataset likely contains the mathematical formulations and results underpinning this analysis.
A mathematical investigation of generalized thermo-electro-magneto elasticity problems for anisotropic layered structures with interfacial cracks. The work by David Natroshvili of Georgian Technical University applies potential methods and pseudodifferential equation theory to prove solution uniqueness and existence in Sobolev spaces. The analysis includes smoothness properties of solutions near crack edges and boundary condition interfaces.
A systematic study investigates bubble size distribution and void fraction in a batch bubble column under homogeneous and heterogeneous operation regimes. The work by Shahrouz Mohagheghian of Oklahoma State University examines the effects of liquid viscosity and gas superficial velocity. Results include statistical characterization of bubble size distributions using probability density functions and probability plots.
A benchmark dataset for testing whether autonomous AI research agents propose novel, mechanism-distinct hypotheses. It contains 10,380 rows of experimental training runs built on Prime Intellect's autonomous-speedrunning archive. The dataset was created by Evo and last updated on May 17, 2026.
18 force field variants, including 10 OPLS and 8 GAFF models, were evaluated and recalibrated for glycine molecular dynamics. James W. Meadows used multiobjective Bayesian optimization to improve predictions of crystal lattice energies, densities, and solution enthalpies. The dataset was published on figshare in April 2026.
Net change in housing units for New York City Community District Tabulation Areas (CDTAs) tracks new construction, demolitions, and alterations since 2010. The dataset is aggregated from the NYC Department of City Planning's Housing Database, which is derived from Department of Buildings-approved jobs. The current version reflects data through the fourth quarter of 2025.
Replication files for a study on financial fragility in Ecuador under full dollarization. The dataset covers 26 years from 2000 to 2025 and includes corporate financial indicators, firm dissolution records, and sectoral calibration data. It was constructed by Fernando Negrete using public corporate registry and macroeconomic data.
Net change in housing units for New York City Council Districts is tracked from 2010 onward, derived from Department of Buildings-approved construction and demolition jobs. The NYC Department of City Planning aggregates this data, with the current version reflecting updates through the fourth quarter of 2025. It includes columns for units completed in specific years (e.g., comp2010 to comp2024), filed, permitted, approved, and withdrawn.
Jin-De Huang provides analysis scripts for processing EarthCARE satellite observations of tropical cyclones. The 386.1 MB repository includes workflows for data preprocessing, colocation with TC tracks, and statistical analyses of radar reflectivity, Doppler velocity, and cloud microphysical properties. The code was last updated on May 1, 2026, and is released under an MIT license to ensure reproducibility.
A simulation experiment from Geoscience Australia compares statistical and mathematical techniques for predicting seabed mud content. The study evaluates factors like sample density, search neighborhoods, and secondary variables including bathymetry and distance-to-coast. It identifies a novel combined method, random forest and ordinary kriging (RKrf), which reduced relative mean absolute error by up to 17% compared to a control.
Performance metrics from the SemEval-2018 evaluation campaign. The dataset includes 95% confidence intervals calculated via 10-fold cross-validation to reflect result variability. It was authored by Weiguang Dong and last updated on April 27, 2026.
Rayane Barcelos Bisi published a summary of the experimental setup for a response surface design study on winter photoperiod supplementation with LED lighting in greenhouses. The 13.5 KB XLS file details plant material, greenhouse conditions, photoperiod strategy, LED spectra, and statistical analysis. It was last updated on April 30, 2026.
Raw data and dot plots supporting a manuscript on the effects of tamsulosin and pioglitazone across the progression of crystalline nephropathy. The dataset was authored by Mariana Perez and last updated on 2026-05-04. It is a 1.2 MB file available under a CC-BY-4.0 license.
A study of sediment distribution in Keppel Bay, a macrotidal environment at the interface of the Fitzroy River catchment and the Great Barrier Reef shelf. Researchers classified seabed sediments into five distinct classes using statistical techniques on grainsize, chemical composition, and modelled seabed shear stress data. The dataset is hosted by the Australian Ocean Data Network and was last updated in April 2026.
135 simulated configurations from eight Mediterranean city case studies optimize urban block design for thermal comfort and energy use. The dataset includes input parameters, UTCI-based comfort scores, Energy Use Intensity values, and Pareto front results. Author Alice Vicini published this 28.6 KB Excel file on figshare in April 2026 under a CC-BY-4.0 license.
A working paper record from the Reality Drift Archive, last updated on 2026-04-26, introduces a set of conceptual terms for analyzing digitally mediated environments. It groups terms like Filter Fatigue, Synthetic Realness, and the Optimization Trap to describe patterns in algorithmic mediation and cultural systems. The document is retained as an exploratory archive for descriptive and comparative purposes.
Sediment sources to the Fitzroy River coastal zone have been identified and quantified using an integrated geochemical and modelling approach. Geochemical data indicate a sediment composition consistent with derivation from mixed catchment sources, with a Bayesian statistical model revealing changes in catchment sediment sources over time. The dataset is provided by the Australian Ocean Data Network and was last updated on 2026-04-16.
Fisheries and Oceans Canada provides yearly spawning stock biomass estimates for Atlantic Cod in the southern Gulf of St. Lawrence (NAFO 4T-4Vn) from November to April. The data, produced via a Statistical Catch-at-Age model and Markov Chain Monte Carlo simulations, includes median estimates and uncertainty percentiles (2.5th, 25th, 75th, 97.5th) measured in thousands of tons. These estimates support stock assessment and fisheries management decisions.