Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,487 datasets
Data for analyzing electric vehicle charging patterns, station loads, and grid optimization metrics. The dataset is hosted on Kaggle, but its specific temporal coverage, creator, and size are not provided. Its description suggests it is intended for analysis related to energy infrastructure and vehicle electrification.
This replication package contains the code to reproduce all analyses from the paper 'More Powerful Cluster Randomized Control Trials'. The author is Brendon McConnell, and the package was last updated in February 2026. No external datasets are included in the analysis.
Comprising Standard Performance Evaluation Corporation (SPEC) CPU benchmark data used for a historical and statistical analysis of hardware artifacts. It was compiled by author Yueyao Wang for a 2024 IEEE paper. The data supports research into long-term computing performance trends.
50 curated hard math problems challenge large language models through high-difficulty reasoning tasks. The collection targets advanced mathematical logic to identify performance plateaus in current AI systems.
Kaggle hosts a dataset for practicing advanced statistical modelling techniques. The description indicates it is intended for machine learning applications. The dataset's author, size, and specific contents are not detailed.
Mathematical Reasoning Dataset is a collection of data for AI and machine learning applications, sourced from the Kaggle platform. The dataset's specific content, size, and creation details are not provided in the available metadata. Its intended use is likely related to training or evaluating models on mathematical problem-solving tasks.
A dataset combining medical and mathematical reasoning tasks, published on Kaggle. The specific content, size, and structure require verification after download. Its creation details and update history are not provided in the available metadata.
150,000 expert-level Python debugging pairs intended for fine-tuning large language models and Direct Preference Optimization. The dataset is hosted on Kaggle and is categorized for computer science and natural language processing applications. The original author, organization, and specific creation date are not provided.
Data Management and Sharing Plan (DMSP) for a research project focused on optimizing tannic acid lipid nanoparticles for a therapeutic mRNA vaccine against melanoma. The plan describes the scientific data to be generated and outlines a strategy for managing and sharing project data. Specific details on data volume, structure, and content are not provided.
Data Management and Sharing Plan describing the scientific data to be generated and/or used in a research project on interacting particle systems and applications. The plan outlines a strategy for managing and sharing the project data. The dataset's specific row count, column count, and data structure are unknown.
Presenting a Data Management and Sharing Plan for a research project on stochastic modeling of active particle systems on non-Euclidean spaces. The plan describes the scientific data to be generated and outlines a strategy for managing and sharing project data. The author is Wai Fan, and the record was last updated in February 2026.
A dataset for co-optimization of e-fuel systems, likely containing parameters for machine learning-assisted design. The dataset is hosted on Kaggle and is associated with a pre-trained model. Specific details on size, authorship, and temporal coverage are not provided in the input.
Transmission line sensor data focused on energy-efficient compression and transmission optimization for smart grids. The dataset is hosted on Kaggle and is tagged as suitable for beginners. Author, organization, and specific data scale are not provided.
A module by Ronni Ross from 2026, described as an undefined attractor for creating conceptual space. The dataset lacks concrete specifications on size, format, or structure.
Arithmetic Test dataset published on huggingface by Kyleyee. The dataset was last updated on 2026-02-09 14:22:47. Its specific content, size, and structure are unknown from the provided metadata.
7.5 million mathematical reasoning problems generated via a Python script. The dataset covers a wide range of topics including arithmetic, algebra, fractions, exponents, and word problems. It was created by DataMuncher-Labs and last updated on December 29, 2025.
Encompassing codes for optimizing layouts in smart remanufacturing contexts. It was authored by Abhijit Gosavi and last updated in February 2026.
A sequence of prime numbers that are simultaneously fractal palindromic primes of first order and Sophie Germain primes. The dataset, created by Emanuele Pace, explores the question of whether there are infinitely many such primes. The specific number of rows and columns is unknown.
Sales_prediction_using_fnn_with_optimization is a dataset hosted on Kaggle. The title suggests it contains data for training a feedforward neural network (FNN) to predict sales. The dataset's specific content, size, and origin are not detailed in the available metadata.
Two categories of e-commerce data cover Meta Ads marketing metrics and Shopify Store sales records. These records pair social media advertising performance with direct-to-consumer store transactions. The content provides a synchronized view of digital advertising spend alongside retail conversion data.