Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,469 datasets
Statistical significance testing results comparing traditional machine learning models with their hybrid SDNN counterparts across various performance metrics. The 5.5 KB Excel file was authored by Muhammad Ishaq and last updated on March 17, 2026. It is licensed under CC-BY-4.0 and hosted on figshare.
A dataset containing results from a Bayesian hierarchical model analyzing student retention and academic advising effects. The 5.5 KB Excel file was authored by Moeketsi Mosia and last updated on March 17, 2026. It is licensed under CC-BY-4.0 and hosted on the figshare platform.
Bayesian hierarchical model results for academic progression, published by Moeketsi Mosia on figshare in March 2026. The dataset is 5.5 KB in size and is available under a CC-BY-4.0 license. It likely contains statistical outputs from a model analyzing student retention and advising effectiveness.
Sirocco-rt's SPECULATE AGN Testgrid V1.3 is a synthetic spectral library for active galactic nuclei. The dataset is designed for the rapid inference of accretion disc wind properties and is associated with the SIROCCO Monte Carlo ionization and radiative transfer code. The dataset page was last updated on 2026-03-16.
Brazilian national survey data from 1975 to 2008 documents secular trends in breastfeeding practices. The median duration of breastfeeding increased from 2.5 to 11.3 months, and exclusive breastfeeding prevalence for infants under six months rose from 3.1% to 41.0%. The study was authored by Sônia Isoyama Venâncio and reanalyzes seven national surveys using consistent statistical techniques.
A meta-analysis integrating and statistically synthesizing research on automated linguistic analysis for deception detection. The study, authored by Valerie Hauch of the University of Giessen, revealed small but significant effect sizes for specific word categories. It addresses inconsistent findings in the literature on using computer programs to analyze verbal cues.
Korbinian Pacher published code on figshare in April 2026 to reproduce statistical models and figures. The code supports a study on strategic predator attack locations and collective prey defense. The resource is intended to facilitate reproducible ecological research.
A 2012 paper by Tiago Soares dos Reis and Walter Gomide from the Instituto Federal do Rio de Janeiro proposes a formal construction of the transreal numbers. This mathematical extension of the real numbers is defined as a class of subsets of ordered pairs of real numbers, establishing closure under addition, subtraction, multiplication, and division, including division by zero.
Kaggle hosts this dataset for time series forecasting. It likely contains historical demand and inventory data for a pharmaceutical company. The author, organization, and last update date are unknown.
UK government data from the Department for Environment, Food & Rural Affairs (Defra) on annual enquiries received via its Enviro inbox. The dataset includes four Excel tabs categorizing queries by description, organization type, subject matter, and type of comment. It is provided by the Government Digital Service under Crown Copyright.
Annual figures relating to the number and type of queries coming into the Enviro inbox on water quality and abstraction. The dataset is structured across four tabs covering query descriptions, organizational types, subject matter, and comment types. It is provided by the UK Government Digital Service with Crown Copyright.
Continuous Plankton Recorder survey data from the North Sea is used to test causal links between zooplankton abundance and the North Atlantic Oscillation. The project examines long-term trends in resident taxa and those dependent on advective input. The dataset is hosted by NASA EarthData and originates from the organization SCIOPS.
Thirty ensemble simulations model the impact of weak random forcing on North American and Eurasian ice sheets. The data presents anomalous freshwater flux to the oceans, calculated as the difference from an unperturbed model run. The model is based on the work of Tarasov and Peltier (1997), with parameters unchanged except for the applied forcing.
GIS boundary and centroid data delineates CCAMLR Statistical Reporting Subareas for Antarctic marine resource management. The dataset includes line boundaries and point centroids with area name attributes, created by the Australian Antarctic Data Centre. It is no longer actively maintained as the boundaries are now available directly from CCAMLR's Online GIS.
Synthetic time-series data generated from a detailed dynamic process model of a CO2 purification plant, used to design and test a real-time optimization and Kalman filter-based control framework. It captures process variables, estimated states, and controller setpoints under various operational scenarios like ramps, load changes, and disturbances. The data was created to estimate unmeasured mass flows and compute optimal solvent flow setpoints to minimize usage while maintaining food-grade CO2 purity.
Government of Ontario statistical reports detail operations for county libraries and county co-operatives. Data is published in HTML and XLSX formats under an open license. The dataset was last updated in February 2026.
Statistical data tracks beekeepers, bee colonies, honey production, yield, average price, and honey value in Ontario. The dataset is provided by the Government of Ontario and was last updated in February 2026.
Statistical data on the production and farm value of maple syrup produced in Ontario. The dataset is provided by the Government of Ontario and was last updated in February 2026. It covers annual production volumes and associated economic values.
Statistical areas for the offshore eastern United States defined by NOAA's National Marine Fisheries Service. The dataset is intended for use in scientific research and regulatory reporting. It was last updated on March 14, 2026.
A de-identified survey dataset used for statistical analysis in a clinical study. The dataset contains responses from 246 patients regarding their awareness and communication about corticosteroid injections. It was authored by Juliet Chung and last updated on March 18, 2026.