Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,487 datasets
A dataset from Kaggle's Research category concerning the development of wearable bioparameter monitoring devices. The data likely contains parameters for graphene-polymer composites optimized using Bayesian methods. The author, organization, and temporal coverage are unknown.
Research data from Kaggle concerning the rapid development and optimization of a dual-action Donpezil-Memantine L. The dataset's specific contents, such as experimental parameters or compound properties, are not detailed in the provided metadata. Author, organization, and temporal coverage are unknown.
A research dataset likely designed for developing and testing a hybrid machine learning model. The dataset's specific contents and scale are not detailed in the provided metadata. It originates from Kaggle and is categorized under the 'Research' tag.
Real-time high-resolution 3D urban reconstruction data likely generated using a Bayesian octree method. The dataset's author, organization, and specific scale are not provided in the metadata. It is hosted on Kaggle under the 'Research' tag.
Kaggle hosts a research dataset on scalable Bayesian credibility models. The dataset likely contains data for modeling trust attribution within federated learning systems. Its specific size, author, and update history are not detailed in the provided metadata.
Delivering the files necessary to replicate the results from the paper 'Treatment Effect on the Association between Outcomes'. The data is associated with research in causal inference and the social sciences, authored by Lai Wei and hosted by Harvard Dataverse.
SAReasoning-5000000 is a synthetic dataset containing algebra math reasoning problems, created by DataMuncher-Labs. The dataset likely contains 5 million rows, each with fields for question, problem, how_to_solve, and answer. It was last updated on December 29, 2025.
LSReasoning-10000000 is a large-scale synthetic dataset for mathematical reasoning, built via a Python script. The dataset includes problems covering addition, subtraction, multiplication, division, linear equations, fractional equations, two-step equations, and algebra word problems. It was created by DataMuncher-Labs and last updated on December 29, 2025.
ggalt is an R package authored by Bob Rudis that extends the ggplot2 visualization library. The package likely contains code and data structures for implementing additional coordinate systems, geometric objects, and statistical transformations. It is listed on the paperswithcode platform, which suggests a focus on computational and graphical methods.
UCI Machine Learning Repository hosts this dataset related to automated theorem proving in first-order logic. The dataset's exact size, creator, and creation date are not specified in the provided metadata. It is intended for research and development in automated reasoning and symbolic AI.
Harvard Dataverse hosts m-files implementing the eSLPN optimization algorithm for three simulations from a related paper. The files include the older eSLP algorithm and results from a previous repository deposit. The dataset, authored by Optimization, Local Pursuit, was last updated in January 2026.
A synthetic dataset of 7.5 million algebraic reasoning problems built via a Python script. It includes problems on topics like two-step equations, fractions, exponents, inequalities, algebra word problems, quadratic equations, and systems of equations. The dataset was created by DataMuncher-Labs and last updated on December 29, 2025.
This dataset replicates statistical results for a study on how threats of U.S. withdrawal from NATO affect European public attitudes towards defense. The data was created by author Hannah Jakob Barrett for the International Organization (IO) Journal Dataverse. Row and column counts are not specified.
A dataset likely containing Doppler frequency signals for research into hybrid compression techniques. The data appears to be sourced from Kaggle under the 'Research' tag, though specific authorship and collection details are not provided. The dataset's size, temporal coverage, and specific signal sources are unknown.
A framework for real-time optimization, likely combining graph-convolutional neural networks with Bayesian optimization methods. The dataset's specific content, size, and creator are not detailed in the provided metadata. It is hosted on Kaggle and categorized under 'Research'.
Real-Time Raman Spectroscopic Monitoring Coupled with Bayesian Optimization is a dataset from Kaggle. The dataset likely contains spectroscopic data used for process monitoring and optimization in manufacturing or research contexts. The dataset's author, organization, size, and last update date are unknown.
A filtered subset of mathematical problems with non-negative integer answers. The dataset is hosted on Kaggle and is associated with the Nemotron project and the AIMO 2026 event. Its specific size, authorship, and licensing details are not provided in the available metadata.
Kyleyee published a dataset titled 'Arithmetic Few Shot' on HuggingFace. The dataset likely contains examples for few-shot learning tasks in arithmetic. Its content and structure require verification after download.
Monthly data from the Federal Reserve tracks industrial production and capacity utilization for manufacturing, mining, and utilities. This Principal Federal Economic Indicator helps analyze structural economic developments and business cycle variations. The dataset provides industrial detail for the U.S. economy.
Hard math problems from the Nemotron-Math-v2 collection, specifically those with an accuracy below 0.5. The dataset includes compressed tool-use information. The number of rows, columns, and specific data fields are unknown.