Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,486 datasets
Telephony data from a VICIdial or Asterisk call center system, focused on the initial two-week operational period. The dataset's origin, size, and specific update date are unknown. It appears to be raw data exported from the platform for analysis.
VICIdial and Asterisk telephony data supports mathematical modeling for contact rate optimization. The dataset's specific volume and creation details are not provided. The author and last update date are unknown.
A guide for optimizing contact rates in telephony systems, focusing on Do-Not-Call list scrubbing and best practices. It is based on data from VICIdial and Asterisk call center platforms. The author and temporal coverage are unknown.
VICIdial/Asterisk telephony data was discussed in a podcast episode focused on contact rate optimization mathematics. The dataset originates from VICIdial, an open-source call center suite. Details on volume, creation date, and specific author are not provided.
VICIdial and Asterisk telephony data supports analysis for optimizing call recording storage. The dataset's specific volume, creator, and update schedule are not documented in the provided input. Further details on row count, columns, and temporal coverage are unavailable.
9,300 declarations extracted from the Coq source files of the MathComp Analysis library. The dataset provides structured formal proofs for classical real analysis, compatible with the Mathematical Components (MathComp) ecosystem. It was created by user 'phanerozoic' and last updated on January 13, 2026.
NVIDIA's Nemotron-RL-math-stack_overflow dataset contains mathematical problems and their corresponding solutions extracted from Stack Overflow forums. The extraction method is similar to that used for the OpenMathReasoning dataset, and only posts where an answer was successfully extracted are included. This dataset was released as part of NVIDIA NeMo Gym and was last updated on January 12, -2026.
Rglpk is an R software interface to the GNU Linear Programming Kit (GLPK), an open-source solver for large-scale linear programming (LP) and mixed integer linear programming (MILP) problems. The package was authored by Stefan TheuΓl. Its specific version, update history, and the exact nature of the example data it may include are not detailed in the provided metadata.
Aaron A. King's POMP package provides tools for analyzing partially observed Markov processes, also known as stochastic dynamical systems and hidden Markov models. It implements facilities for simulating these models and fitting them to time series data using frequentist and Bayesian methods. The package serves as a versatile platform for inference methods applicable to general POMP models.
gstat is a dataset for geostatistical modelling, prediction, and simulation. The description indicates it supports variogram modelling, kriging, and spatio-temporal kriging, as well as simulation methods. It was authored by Edzer Pebesma and is hosted on the paperswithcode platform.
Hans W. Borchers authored a collection of functions for numerical analysis and linear algebra, numerical optimization, differential equations, and time series. The package includes well-known special mathematical functions and uses 'MATLAB' function names where appropriate to simplify porting of code.
The replication package for the paper 'Optimal Tests Following Sequential Experiments', accepted for publication in the Journal of Political Economy in 2026. It contains data and code to reproduce the study's findings on hypothesis testing in sequential experimental designs.
A dataset related to mathematical logic and its implementation using the C++ programming language. It was published on the Kaggle platform. The specific content, size, and authorship details are not provided in the available metadata.
2,952 declarations extracted from Coq source files formalizing category theory without axioms. The dataset is sourced from the GitHub repository https://github.com/jwiegley/category-theory and was last updated on January 13, 2026. It was authored by phanerozoic.
Statistical tables on capital punishment for the year 2007, aggregated from paperswithcode. The dataset likely contains quantitative data on death penalty cases, sentences, and demographics. Its specific variables and geographic scope require verification after download.
This dataset relates to the automated verification and optimization of temporal logic models for neuroprosthetic control systems. It involves formal verification methods and time series analysis, as indicated by its tags.
Encompassing code and data developed for a manuscript applying the Theory of Functional Connections to reconcile an approximate ordinary differential equation (ODE) for soil moisture evolution with noisy satellite data. The data is extrapolated from the NASA SMAP L4 data product. The author is Salvatore Calabrese.
2026 calculations model resonant scattering of He I 1.0833um triplet photons in uniform density spherical H II regions. The dataset contains results from Monte Carlo radiative transfer simulations performed by Bruce Draine, comparing models with and without dust. Specific row and column counts are not provided.
FrontierCO is a curated benchmark suite for evaluating machine learning-based solvers on large-scale and real-world combinatorial optimization problems. The benchmark spans 8 classical CO problems across 5 application domains, providing both training and evaluation data. It was created by CO-Bench and last updated on Hugging Face in January 2026.
This dataset supports the replication of results for a study on monitoring functional anomalies in cyclical water treatment processes. It contains data and code authored by Hunter J. Privett and collaborators from the Mo(Wa)Β²TER Datasets organization. The dataset was last updated in February 2026.