DataSalon

Discover quality datasets for AI training — aggregated from 40+ platforms, curated by AI.

ProductSearch Datasets Browse Topics Rankings Community API / MCP

ResourcesDocumentation Blog Changelog Status

LegalPrivacy Policy Terms of Service Cookie Policy

Mathematics & Statistics Datasets | DataSalon

All Categories

📐

Mathematics & Statistics

Mathematical datasets, statistical benchmarks, probability, optimization, operations research

3,080 datasets

Bubble Size and Void Fraction Data in a Porous Sparger Column

A systematic study investigates bubble size distribution and void fraction in a batch bubble column under homogeneous and heterogeneous operation regimes. The work by Shahrouz Mohagheghian of Oklahoma State University examines the effects of liquid viscosity and gas superficial velocity. Results include statistical characterization of bubble size distributions using probability density functions and probability plots.

TabularBubble ColumnMultiphase FlowFluid dynamicsChemical Engineering+1

0 views

Mathematics & Statistics

Autoresearch Novelty Bench: AI Agent Hypothesis Benchmark

A benchmark dataset for testing whether autonomous AI research agents propose novel, mechanism-distinct hypotheses. It contains 10,380 rows of experimental training runs built on Prime Intellect's autonomous-speedrunning archive. The dataset was created by Evo and last updated on May 17, 2026.

TabularHypothesis GenerationAi ResearchBenchmarkAutonomous Agents+1

0 views

Mathematics & Statistics

Force Field Evaluations for Glycine Polymorphs and Solution Properties

18 force field variants, including 10 OPLS and 8 GAFF models, were evaluated and recalibrated for glycine molecular dynamics. James W. Meadows used multiobjective Bayesian optimization to improve predictions of crystal lattice energies, densities, and solution enthalpies. The dataset was published on figshare in April 2026.

TabularZIPForce Field OptimizationMolecular DynamicsComputational ChemistryGlycine Polymorphism+1

0 views

Mathematics & Statistics

NYC Housing Unit Changes by Community District Since 2010

Net change in housing units for New York City Community District Tabulation Areas (CDTAs) tracks new construction, demolitions, and alterations since 2010. The dataset is aggregated from the NYC Department of City Planning's Housing Database, which is derived from Department of Buildings-approved jobs. The current version reflects data through the fourth quarter of 2025.

TabularGeospatialCSVXMLJSONHousing UnitsJob ApplicationCommunity District Tabulation AreaClass ABuildingHousingFilingPipelineCensus BlockCoNet UnitsProposed UnitsHotelNew York CityClass BUnit ChangeUrban PlanningConstructionCertificate Of OccupancyDemolitionCensus GeographyNtaNeighborhood Tabulation AreaCdta+1

0 views

Mathematics & Statistics

EFDI: Ecuadorian Corporate Financial Distress Data, 2000–2025

Replication files for a study on financial fragility in Ecuador under full dollarization. The dataset covers 26 years from 2000 to 2025 and includes corporate financial indicators, firm dissolution records, and sectoral calibration data. It was constructed by Fernando Negrete using public corporate registry and macroeconomic data.

TabularEmerging EconomiesEconometricsFinancial DistressCorporate FinanceEcuadorFinance+1

0 views

Mathematics & Statistics

NYC Housing Unit Change by City Council District Since 2010

Net change in housing units for New York City Council Districts is tracked from 2010 onward, derived from Department of Buildings-approved construction and demolition jobs. The NYC Department of City Planning aggregates this data, with the current version reflecting updates through the fourth quarter of 2025. It includes columns for units completed in specific years (e.g., comp2010 to comp2024), filed, permitted, approved, and withdrawn.

TabularGeospatialCSVXMLJSONHousing UnitsCensus TractJob ApplicationPumaUrban DevelopmentCity PlanningBuildingHousingCommunity DistrictFilingPipelineCity Council DistrictCensus BlockCoNet UnitsDwelling UnitDevelopmentUnit ChangeConstructionPermitCertificate Of OccupancyNtaHouse+1

0 views

Mathematics & Statistics

EarthCARE-TC: Tropical Cyclone Structure Analysis Scripts

Jin-De Huang provides analysis scripts for processing EarthCARE satellite observations of tropical cyclones. The 386.1 MB repository includes workflows for data preprocessing, colocation with TC tracks, and statistical analyses of radar reflectivity, Doppler velocity, and cloud microphysical properties. The code was last updated on May 1, 2026, and is released under an MIT license to ensure reproducibility.

TabularTropical CyclonesSatellite DataRadar AnalysisCloud MicrophysicsEarthcare+1

0 views

Mathematics & Statistics

Seabed Mud Content Simulation for Australian Margin Interpolation

A simulation experiment from Geoscience Australia compares statistical and mathematical techniques for predicting seabed mud content. The study evaluates factors like sample density, search neighborhoods, and secondary variables including bathymetry and distance-to-coast. It identifies a novel combined method, random forest and ordinary kriging (RKrf), which reduced relative mean absolute error by up to 17% compared to a control.

Tabular🇦🇺 AustraliaSpatial InterpolationSeabed MudBathymetryGeoscience+1

0 views

Mathematics & Statistics

SemEval-2018 Performance Results with Statistical Validation

Performance metrics from the SemEval-2018 evaluation campaign. The dataset includes 95% confidence intervals calculated via 10-fold cross-validation to reflect result variability. It was authored by Weiguang Dong and last updated on April 27, 2026.

TabularExcelNlp EvaluationSemantic Analysis+1

0 views

Mathematics & Statistics

Winter Greenhouse LED Photoperiod Supplementation Experimental Setup

Rayane Barcelos Bisi published a summary of the experimental setup for a response surface design study on winter photoperiod supplementation with LED lighting in greenhouses. The 13.5 KB XLS file details plant material, greenhouse conditions, photoperiod strategy, LED spectra, and statistical analysis. It was last updated on April 30, 2026.

TabularExcelResponse Surface DesignPhotoperiodGreenhouse ExperimentPlant Science+1

0 views

Mathematics & Statistics

Effects of Tamsulosin and Pioglitazone on Crystalline Nephropathy

Raw data and dot plots supporting a manuscript on the effects of tamsulosin and pioglitazone across the progression of crystalline nephropathy. The dataset was authored by Mariana Perez and last updated on 2026-05-04. It is a 1.2 MB file available under a CC-BY-4.0 license.

TabularNephrologyMedical ResearchPharmacologyClinical Data+1

0 views

Mathematics & Statistics

Keppel Bay Seabed Sediment Classes Based on Grainsize and Acoustic Mapping

A study of sediment distribution in Keppel Bay, a macrotidal environment at the interface of the Fitzroy River catchment and the Great Barrier Reef shelf. Researchers classified seabed sediments into five distinct classes using statistical techniques on grainsize, chemical composition, and modelled seabed shear stress data. The dataset is hosted by the Australian Ocean Data Network and was last updated in April 2026.

AudioGeospatialSediment TransportOceanographyGeomorphologySeabed MappingCoastal Science+1

0 views

Mathematics & Statistics

Optimization Results for Mediterranean Urban Block Thermal and Energy Performance

135 simulated configurations from eight Mediterranean city case studies optimize urban block design for thermal comfort and energy use. The dataset includes input parameters, UTCI-based comfort scores, Energy Use Intensity values, and Pareto front results. Author Alice Vicini published this 28.6 KB Excel file on figshare in April 2026 under a CC-BY-4.0 license.

TabularExcelUrban PlanningMediterraneanEnergy PerformanceSimulationThermal Comfort+1

0 views

Mathematics & Statistics

Arithmetic Dataset by WhirlwindAI

Arithmetic data published on the Hugging Face platform by WhirlwindAI. The dataset was last updated on July 7, 2026. The specific content, scale, and structure require verification after download.

TabularNumerical DataArithmeticMathematicsEducation+1

1 views

Mathematics & Statistics

Reality Drift: Conceptual Terms for Algorithmic Mediation and Perception

A working paper record from the Reality Drift Archive, last updated on 2026-04-26, introduces a set of conceptual terms for analyzing digitally mediated environments. It groups terms like Filter Fatigue, Synthetic Realness, and the Optimization Trap to describe patterns in algorithmic mediation and cultural systems. The document is retained as an exploratory archive for descriptive and comparative purposes.

TextAlgorithmic MediationDigital MediaCultural AnalysisConceptual FrameworkSynthetic+1

0 views

Mathematics & Statistics

Fitzroy River Basin Sediment Source Analysis Using Geochemical and Bayesian Models

Sediment sources to the Fitzroy River coastal zone have been identified and quantified using an integrated geochemical and modelling approach. Geochemical data indicate a sediment composition consistent with derivation from mixed catchment sources, with a Bayesian statistical model revealing changes in catchment sediment sources over time. The dataset is provided by the Australian Ocean Data Network and was last updated on 2026-04-16.

TabularTime SeriesHydrological ModellingCoastal ZoneSediment SourcesGeochemistryLand Use Impact+1

0 views

Mathematics & Statistics

Atlantic Cod Spawning Biomass Estimates for Southern Gulf of St. Lawrence

Fisheries and Oceans Canada provides yearly spawning stock biomass estimates for Atlantic Cod in the southern Gulf of St. Lawrence (NAFO 4T-4Vn) from November to April. The data, produced via a Statistical Catch-at-Age model and Markov Chain Monte Carlo simulations, includes median estimates and uncertainty percentiles (2.5th, 25th, 75th, 97.5th) measured in thousands of tons. These estimates support stock assessment and fisheries management decisions.

TabularTime Series🇨🇦 CanadaCSVAtlantic codBiomass EstimationFisheries ManagementFinanceMarine Biology+1

0 views

Mathematics & Statistics

Australian Coastal Storm Event Statistics with Clustering Analysis

A 30-year timeseries of observations from the eastern and southern coast of Australia was used to extract independent storm events. The dataset contains multivariate summary statistics for extreme storm events, including maximum significant wave height, wave period, direction, duration, peak storm surge, and time of occurrence. This data was produced by Geoscience Australia as part of the Bushfire and Natural Hazards CRC Project, with preliminary results from a study site on the central coast of New South Wales.

TabularTime SeriesExternal PublicationEarth sciencesExtreme Ocean ClimateCoastal HazardsGeoscience AustraliaStorm ClusteringWave StatisticsMarineConference PaperPublished ExternalSynthetic+1

0 views

Mathematics & Statistics

RUL Prediction Performance Comparison Under Sensor Fault Conditions

Dongdong Tang published a 5.5 KB Excel file on figshare in April 2026. The dataset compares the performance of a proposed framework against representative deep learning models for predicting Remaining Useful Life (RUL) under sensor fault conditions. Evaluation metrics include RMSE, the standard C-MAPSS scoring function, and statistical significance.

TabularExcelPrognosticsCmapssRul PredictionSensor FaultDeep Learning+1

0 views

Mathematics & Statistics

RUL Prediction Performance Comparison Under Sensor Fault Conditions

A comparison of Remaining Useful Life prediction performance between a proposed framework and representative deep learning models under sensor fault conditions. The dataset was authored by Dongdong Tang and last updated on 2026-04-29. Performance is evaluated using RMSE, the standard C-MAPSS scoring function, and statistical significance.

TabularExcelPrognosticsRul PredictionSensor FaultPerformance ComparisonDeep Learning+1

0 views

PreviousPage 65 of 154Next