Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,446 datasets
A 2010 March campaign using the Murchison Widefield Array 32-element prototype observed two ~50-degree fields in the southern sky, covering approximately 2700 square degrees. The resulting catalog contains 655 high-significance radio sources and 871 lower-significance candidates detected in the 110-200 MHz band. This dataset was created by NASA HEASARC in October 2012 based on the reference paper's source list.
A catalog of radio sources selected down to a 5 sigma threshold from the COSMOS field, produced by NASA. The catalog is intended for statistical analyses and includes corrections detailed in the associated 2016 release paper. The total fraction of spurious sources in the full 2 square degree area is below 2.7%, but can be reduced to below 0.4% by applying a signal-to-noise ratio cut.
2000, 2010, and 2022 decadal census data harmonized for São Paulo, Brazil. The dataset includes variables such as population density, income, household size, sewage coverage, verticalisation, mobility distances, and urban informality overlaps. It was created by Alex Monito Nhancololo and last updated on April 28, 2026.
Li Zheng's dataset on figshare contains data related to the discovery of a potent RUVBL1/2 inhibitor for targeting MYC-driven cancers. The data, last updated in April 2026, includes PDB files totaling 836.8 KB. The description details a drug discovery campaign using a Single-Molecule Tracking (SMT) assay to optimize compound efficacy.
A 7-dimensional nonlinear ordinary differential equation model for malaria transmission, analyzed using He's Variational Iteration Method. The model's basic reproduction number, equilibrium points, and stability are examined, with solutions compared against a Runge-Kutta-Fehlberg method. The work was authored by Kingsley Akinfe of Tai Solarin University of Education.
The Australian Ocean Data Network provides a dataset on spectral representation of isostatic models. It describes the use of cross-spectral techniques and admittance functions to mathematically relate gravity and topography, improving computational efficiency over conventional methods. The dataset was last updated on 2026-05-05.
A 688.9 KB HTML document authored by Alberto Garcia-Hernandez, last updated on 2026-04-14. It presents a geometric framework for the synthesis method used in non-inferiority clinical trials, comparing it to the traditional fixed margin method. The work is licensed under CC-BY-4.0 and hosted on figshare.
5.5 KB Excel file compares data repair methods for multivariate time-series with 20% simulated sensor failures. The dataset, authored by Dongdong Tang, evaluates methods based on reconstruction accuracy metrics like RMSE and SSIM and statistical significance. It was last updated on April 29, 2026.
A 100,000-iteration Monte Carlo simulation evaluates the economic feasibility of extracting bioactive oils from native Brazilian Cerrado fruits. The dataset contains saved variables for the 'FCO Integral' financing scenario from a technical-economic study authored by Raphael Luiz Fernandes Marques de Souza. It was last updated in April 2026.
Monte Carlo simulation results evaluate the economic viability of extracting oil from native Brazilian Cerrado fruits. The dataset contains 100,000 simulation iterations for a project with a total investment of R$ 451,674.34 and a ten-year analysis horizon. Author Raphael Luiz Fernandes Marques de Souza published the data in April 2026.
A spreadsheet from a 2026 study by Raphael Luiz Fernandes Marques de Souza saves one scenario from a Monte Carlo simulation. The analysis models the financial viability of extracting oils from native Brazilian Cerrado fruits like Pequi and Buriti. It contains 100,000 simulation iterations evaluating key financial indicators under two funding scenarios.
Raphael Luiz Fernandes Marques de Souza published a spreadsheet containing Monte Carlo simulation results for a technical and economic feasibility study. The analysis models a 10-year project for extracting oils from native Brazilian Cerrado fruits like Pequi and Buriti. The dataset was uploaded to figshare in April 2026.
251 machine-checked theorems of mathematical finance, formalized in Lean 4 on top of Mathlib and Rémy Degenne's BrownianMotion package. Each row is one theorem, containing its Lean statement and proof, its domain, and a faithfulness tier. The dataset was extracted from the formal-mathfin library by author raphaelrrcoelho and was last updated on 2026-05-31.
Florida and surrounding regions were the focus of the CAMEX-4 DC-8 Microwave Temperature Profiler dataset. This dataset contains vertical profiles of atmospheric temperature derived from passive microwave radiometer measurements at 56.6 and 58.8 GHz, collected aboard a DC-8 aircraft during the campaign from August 16 to September 25, 2002. The National Aeronautics and Space Administration produced these data, which generate a temperature profile every three kilometers along the flight path.
56.6 and 58.8 GHz frequencies were used by the Microwave Temperature Profiler (MTP) on the ER-2 aircraft to measure atmospheric thermal emissions. The instrument produced an altitude temperature profile every three kilometers along the flight path during the CAMEX-4 campaign. Data collection was managed by the National Aeronautics and Space Administration from the Jacksonville Naval Air Station, Florida.
A 5.5 KB Excel file containing statistical comparison results for machine learning classifiers. The dataset, authored by Sawera Qureshi, was last updated on May 21, 2026. It likely contains results from paired t-tests performed across cross-validation folds.
Performance metrics and statistical comparisons for classification models targeting Mild Cognitive Impairment and Alzheimer's Disease. The 9.5 KB Excel file was authored by Minsoo Kim and last updated on May 21, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
A dataset and MATLAB implementation for the DSDRO algorithm, focusing on bike-sharing system rebalancing with dynamic pricing. The materials include station coordinates, station demand data, and source code for the PSO-LNS solver, ablation study, and validation scripts. The dataset is authored by Jianrong Cai and was last updated on April 16, 2026.
Weihong Zhao authored a statistical analysis of semantic similarity metrics applied to individual sentence pairs. The dataset is available as a 5.5 KB Excel file on figshare. It was last updated on May 21, 2026.
Bayesian Model Parameters for Velocity (log(cm/s)) is a dataset of statistical parameters. Diego Lievano Parra authored the dataset, which was last updated on May 21, 2026. The dataset is stored in an XLS file and is 9.5 KB in size.