Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,454 datasets
Monthly updated statistics on hotel stays, guests, and occupancy rates in Utrecht's public area. The dataset, sourced from CBS Statline and managed by Utrecht Marketing, also includes reports on tourism, business visits, conferences, and monthly visitor figures for museums, theaters, and historical buildings behind a login. It contains descriptive and statistical information about residents, visitors, companies, and talents in the city.
Statistical data on licensed ferment-on-premises operators in Nova Scotia. The dataset compares the total volume of production of wine and beer, excluding kits sold for home production. It is provided by the Government of Nova Scotia and was last updated on April 17, 2026.
26.9 hours of motion data from 140 subjects across 10,386 trials. ForceBody pairs the SKEL parametric body model with measured ground reaction forces and inverse-dynamics joint torques. A subset of 8,652 trials includes per-frame, per-joint Monte Carlo uncertainty estimates for torque labels.
OR-Space is a full-lifecycle workspace benchmark for evaluating LLM agents on industrial optimization tasks. The benchmark, created by Chenyu-Zhou, structures each instance with separate files for business requirements, parameters, source code, and solver artifacts. It was last updated on May 18, 2026.
384 hours of observations with the Karl G. Jansky Very Large Array (VLA) at 3 GHz produced this catalog of 10,830 radio sources down to a 5-sigma threshold. The survey covers the 2 square degree Cosmic Evolution Survey (COSMOS) field with a median rms of 2.3 ΞΌJy/beam at 0.75 arcsecond resolution. The catalog was created by the HEASARC in June 2017 based on the reference paper's data release.
Shijuan Yang's dataset contains lifetime data for gas turbine components, including failure times and controllable factor settings. The 34.1 KB XLSX file is used to demonstrate a mixture Weibull regression and multi-objective optimization framework for lifetime-oriented quality design. It was last updated on 2026-04-30 and is shared under a CC-BY-4.0 license.
Supplementary material 3 from a Bayesian network meta-analysis evaluating therapies for granulomatous mastitis. The 25.0 KB XLS file, authored by Pin Wang and last updated in April 2026, provides operational definitions for clinical outcomes like Complete Response (CR), Overall Response Rate (ORR), and Recurrence Rate (RR) used in the included studies.
Supplementary data from a Bayesian network meta-analysis evaluating therapies for granulomatous mastitis. The dataset contains matrices of pairwise comparisons for overall response rates, reported as odds ratios and 95% credible intervals. It was authored by Pin Wang and published on figshare under a CC-BY-4.0 license.
NYC Department of City Planning's Housing Database tracks net changes in housing units for New York City Community Districts. It aggregates data from Department of Buildings-approved construction and demolition jobs filed or completed since January 1, 2010. The dataset includes census unit counts, net changes, and units pending completion.
A 9.5 KB Excel file containing results from a hyperparameter optimization study. The work by Mario Koddenbrock, last updated in May 2026, demonstrates that tuning on synthetic SynthMT images can improve the SAM3Text model to human-grade performance on unseen, real IRM data.
A 5.5 KB Excel file containing the results of a model performance comparison with statistical analysis. The dataset, authored by Shanyue Wang, was last updated on April 28, 2026. It calculates the delta, or difference, between the outputs of two models named MsgaBpred and EpiGraoh.
Statistical results of coal petrographic identification for the B-coal seams in the Xishanyao Formation. The dataset is a 19.1 KB XLSX file authored by Bin Chen and last updated on 2026-05-05. Its license is CC-BY-4.0, facilitating open reuse.
Statistical results from coal petrographic identification of the B-coal seams in the Xishanyao Formation. The dataset is a 9.5 KB XLS file authored by Bin Chen and last updated on May 5, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
Hongjing Chang published a dataset titled 'Samplesβ Evaluations on 4 Translations in RRQ Based on Descriptive Statistical Analysis and One-Sample T Test' on figshare in May 2026. The 5.5 KB XLS file likely contains statistical evaluation data for four different translations, possibly related to a research questionnaire (RRQ). The dataset's specific row count and column details are not provided in the metadata.
Statistical results from a study proposing a Transformer-XGBoost model for predicting 28-day cement compressive strength. The method was validated using real-world strength testing data from cement plants, achieving an average RΒ² of 0.94 in Monte Carlo cross-validation.
28-day compressive strength data from cement plants was used to validate a hybrid Transformer-XGBoost prediction model. The model achieved an average RΒ² of 0.94 in 25 Monte Carlo cross-validations, demonstrating high accuracy for small-sample scenarios. The dataset contains the results of this optimization study, authored by Dianyuan Ju and shared in 2026.
892.7 KB of research materials authored by Rachid Belfadli, last updated on April 22, 2026. The content includes a paper proving the existence and uniqueness of solutions for two classes of doubly reflected backward stochastic differential equations driven by pure jump Markov and jump semi-Markov processes. The analysis is based on the Snell envelope technique and a penalization method.
Simulation, training, and optimization data supporting a 2026 research publication on Floating Production Storage and Offloading (FPSO) units. The dataset, 65.2 MB in size, was created by Jiaqi Zhang and colleagues to capture non-linear fluid-structure-mooring interactions under extreme environmental conditions. It is stored in DAT and XLSX file formats.
Opus 4.6 10000X is a dataset of 10,000 high-fidelity reasoning traces synthesized using the Claude Opus 4.6 model. It was created by user 'ansulev' and last updated on Hugging Face in May 2026. The dataset is designed to capture the model's internal 'Chain of Thought' and reasoning patterns.
981 globally distributed hosting providers form the basis of this large-scale quantitative study correlating server technology stacks with DNS resolution efficiency. The analysis isolates the impact of Cloudflare, LiteSpeed, Apache HTTPD, and Nginx using a 10% trimmed mean methodology to exclude latency outliers. Empirical findings indicate a statistically significant performance advantage for edge-native, decentralized architectures over legacy centralized setups.