Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,446 datasets
An overview of datasets used for load forecasting and cost optimization tasks in energy distribution networks. The dataset is a 5.5 KB Excel file authored by Wei Xiong and last updated on May 29, 2026. It is shared under a CC-BY-4.0 license on the figshare platform.
TransGrid-CostOpt is a 5.5 KB Excel dataset associated with a hybrid transformer framework for predicting and optimizing costs of distribution network assets. Wei Xiong authored this dataset, which was last updated on May 29, 2026. Its small size suggests it likely contains a focused set of model results or parameters.
Experimental datasets supporting research on overcoming atmospheric limitations in satellite imagery. The 2.9 GB collection, published by dongyi li under a CC-BY-4.0 license, includes TIF and XLSX files. It was last updated on June 2, 2026.
A comparative table of statistical effects for electrophysiological measures related to attentional splitting. The dataset, authored by Vincenzo Ronca, was last updated on May 26, 2026. It likely contains a summary of significance tests and qualitative interpretations for Parietal Alpha, Inverse Frontal Beta, Frontal Theta/Beta ratio, and an MI-based ASI measure.
Empirical measurement data for NRA-POC, a stateless Tor-based secure communication system designed for legal professionals. The dataset contains results from four measurement campaigns covering metadata entropy, communication graph reconstructability, session artifact purge timing, and Perfect Forward Secrecy effectiveness. It is a companion dataset to a manuscript submitted to the Journal of Information Security and Applications (JISA) and was authored by Tomasz Janczewski.
A 2026 study by Horacio Rivarola presents a validated biomechanical and mathematical model for predicting the clinical efficacy of intra-articular sodium hyaluronate injections in osteoarthritis. The model, validated against clinical outcomes from 126 knee osteoarthritis patients, identifies an optimal viscoelastic window associated with superior clinical improvement. The research is published as a supplementary document on figshare under a CC-BY-4.0 license.
SIFORT is a geospatial forest database for Quebec composed of polygonal units averaging 14 hectares each. The system integrates forest information from up to five historical inventories, assigning data like cover type, disturbance origin, and species density to each grid cell. This fixed analysis grid, maintained by the Government and Municipalities of Québec, supports temporal studies for sustainable forest management.
A 2.2 KB CSV dataset from figshare, authored by Lavleen K. Mader and last updated in May 2026. It contains quantitative assay results establishing glutathione S-transferase (GST) kcat and KM values for a panel of structurally diverse electrophilic warheads used in targeted covalent inhibitor development.
STGLDWeather outperforms competing methods in a Root Mean Square Error (RMSE) comparison. The 13.5 KB Excel file contains results where statistical significance (p < 0.05) compared to GraphCast is indicated. Author ZhiPeng Wu uploaded this comparative analysis to figshare in June 2026.
ZhiPeng Wu's dataset provides a performance evaluation on extreme events and categorical metrics for Wind Speed (u10). The data is stored in a 5.5 KB XLS file and was last updated on June 4, 2026. The license is CC-BY-4.0, indicating open access for reuse.
Results of Monte Carlo simulations and probability density function (PDF)–based error assessments authored by Jessica V. Eberle. The dataset is a 13.5 KB XLS file last updated on June 4, 2026. It is available under a CC-BY-4.0 license on the figshare platform.
A 9.5 KB Excel file provides a statistical summary of Quantitative Trait Loci (QTLs) significantly associated with dietary fiber-related traits. The QTLs were identified by Genome-Wide Association Study (GWAS) in populations of Vaccinium meridionale. The dataset was authored by Ginna Patricia Velasco Anacona and last updated on June 4, 2026.
A simulation experiment using samples from the Geoscience Australian Marine Samples database to compare statistical and mathematical techniques for predicting seabed mud content. The study assessed five factors affecting accuracy, including regions, methods, and sample densities, using ten-fold cross-validation and secondary variables like bathymetry. Outcomes aim to improve the modeling of physical properties for marine biodiversity prediction.
12 authorship and contributorship dispute cases from the Committee on Publication Ethics forum were used to evaluate two large language models. Kannan Sridharan published this comparative analysis in April 2026. The dataset contains performance scores across seven evaluation domains for Google Gemini 2.5 Flash and DeepSeek-V3.2.
12 authorship and contributorship dispute cases from the Committee on Publication Ethics (COPE) forum were used to evaluate two large language models. The study, authored by Kannan Sridharan and shared in 2026, scored model responses across seven domains, including Actionability of Recommendations and Consistency with COPE Principles. Both Gemini and DeepSeek models achieved perfect scores in one domain but showed weaknesses in identifying specific ethical issues.
12 authorship and contributorship cases from the Committee on Publication Ethics forum were used to evaluate two large language models. Kannan Sridharan published comparative performance scores across seven domains, including Actionability of Recommendations and Consistency with COPE Principles, in April 2026. The dataset contains Likert scale scores and qualitative disagreement rates from this cross-sectional analysis.
A comparative evaluation scores the performance of Google Gemini 2.5 Flash and DeepSeek-V3.2 LLMs against expert Committee on Publication Ethics responses. The study analyzes 12 authorship and contributorship cases using three prompting strategies, with responses rated across seven domains on a 5-point Likert scale. The dataset was authored by Kannan Sridharan and published on figshare in April 2026.
A PDF document presents a cross-sectional analysis comparing the performance of Google Gemini and DeepSeek LLMs against expert COPE forum responses. The study includes 12 authorship and contributorship cases, with responses scored across seven domains on a 5-point Likert scale by independent raters. Author Kannan Sridharan published this 89.1 KB file on figshare in April 2026.
12 authorship and contributorship cases from the Committee on Publication Ethics (COPE) forum were used to evaluate two large language models. The dataset contains scores across seven domains, including Actionability of Recommendations and Identification of Ethical Issues, rated on a 5-point Likert scale. Kannan Sridharan published this comparative analysis in April 2026.
Twelve authorship dispute cases from the Committee on Publication Ethics forum were used to evaluate two large language models. Kannan Sridharan published this comparative analysis in April 2026. The dataset contains scored model responses across seven evaluation domains.