Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,444 datasets
Experimental data describes the discovery and characterization of FYJ-195, a type II FLT3 inhibitor for acute myeloid leukemia. The dataset includes in vitro IC50 values against specific resistance mutations and in vivo tumor growth inhibition results from xenograft models. Jin Yang authored this dataset, which was last updated on 2026-05-23.
A 233.4 KB dataset authored by Belmiro P.M. Duarte and last updated on 2026-06-02. It supports a framework for maximum von Neumann entropy optimal experimental design, leveraging semidefinite programming for multiresponse models with correlated outputs. The dataset includes Python code and a PDF document illustrating applications in dose-response, sensor placement, and kinetic modeling.
Experimental data from a study proposing a dynamic principal component analysis methodology for detecting progressive failures in spur gearboxes. The dataset covers combinations of load, speed, and failure severity, validated using vibration signals segmented and characterized with time and frequency domain indicators. It was authored by Antonio PΓ©rez-Torres and last updated on 2026-05-18.
Q-mode factor analysis classified estuarine sediment samples from Broad Sound, Queensland, into two geologically distinct groups representing intertidal and supratidal deposition. The dataset, hosted by the Australian Ocean Data Network, was last updated in June 2026. It identifies processes controlling concentrations of P2O5, Cu, Pb, and Zn using R-mode factor analysis and stepwise regression.
Lin Han authored a study investigating anti-Kelch-like protein 11 (KLHL11) encephalitis, published on figshare. The work includes detailed reports of two patient cases and a review of literature from the last six years. The dataset comprises a 415.8 KB PDF document.
Two case reports detail the clinical features and treatment responses of patients with anti-Kelch-like protein 11 (KLHL11) antibody-associated encephalitis. The data includes patient ages, antibody titers, treatment regimens, and outcomes. The document was authored by Lin Han and last updated on June 1, 2026.
A retrospective pharmacovigilance analysis of 5,860 ocular adverse event reports associated with five SNRIs extracted from the FDA Adverse Event Reporting System (FAERS) database from Q1 2004 to Q4 2024. The study, authored by Zhengtai Sun, used disproportionality analyses to identify drug-specific and sex-specific safety signals.
Farouk Mark Mukiibi's dataset documents verifiable citations and references to the Minimum Viable Relationships (MVR) Framework by major AI systems. It consolidates multi-platform proof of attribution across OpenAI ChatGPT, xAI Grok, Perplexity AI, Google Gemini, Meta AI, and Microsoft Copilot. The 1.1 MB dataset, last updated on 2026-05-24, includes machine-readable JSON evidence, public archives, and SHA-256 integrity hashes.
255 plant-based larvicidal compounds against the Zika vector Aedes aegypti were analyzed using QSAR models developed with CORAL software and Monte Carlo optimization. The dataset includes pLC50 bioactivity values and molecular docking results for selected compounds. S. Lotfi published the data on figshare in 2026.
A dataset from 2026 describes a series of biaryl-substituted pyrazolopyrimidine inhibitors of the TgCDPK1 enzyme for treating Toxoplasma gondii infections. The data, shared by Michael P. Mannino on figshare, includes compound properties optimized for metabolic stability, plasma protein binding, efflux, and pharmacokinetics. It is a small dataset of 8.8 KB in CSV format.
5.5 KB of statistical analysis results for the AMGST traffic forecasting model. The dataset, authored by Pei Shi and last updated in June 2026, contains experimental results from evaluating the model on four public traffic datasets. It is hosted on figshare under a CC-BY-4.0 license.
Slow Ripening Grapevine Genotypes contains supplementary datasets from a project characterizing grapevine material that fails to accumulate sugar until late in the season. The repository includes 11 Excel tables with statistical comparisons and logistic regression model parameters for traits like total soluble solids, berry weight, and firmness across multiple years and experiments. Pietro Previtali authored the dataset, which was last updated on June 4, 2026.
Hang Li's dataset contains survey results from 272 orthopedic theatre nurses across eight tertiary hospitals in Shanxi Province, China, collected between September and December 2024. The data includes scores for Knowledge, Attitude, and Practice (KAP) regarding the use of orthopedic power tools and results from multivariable regression analysis. It was last updated on figshare in May 2026.
Geoscience Australia conducted a simulation experiment comparing statistical and mathematical techniques for predicting seabed mud content across the Australian continental margin. The study assessed factors including regions, sample densities, and interpolation methods using bathymetry, distance-to-coast, and slope as secondary variables. A novel combined method, random forest and ordinary kriging (RKrf), demonstrated a relative mean absolute error up to 17% less than a control method.
Event permit applications for Chicago's public parks include details on the applicant, event type, description, park location, and scheduled times. The dataset is maintained by the Chicago Park District and tracks permit statuses. It is available in multiple formats including CSV, JSON, and XML.
Supplementary file 1 from a hybrid study by Ye Liu assesses modifiable risk factors for atrial fibrillation and flutter in young adults aged 15-39 years. The dataset likely contains results from a global burden analysis spanning 1990 to 2021 and a local cohort study, including age-standardized rates and regression analyses. It was last updated on figshare in May 2026 under a CC-BY-4.0 license.
Sixty-five anonymized clinical radiotherapy plans were used to benchmark three dose calculation algorithms against a Monte Carlo standard. The study, authored by Yutong Zhao and last updated in May 2026, found AXB generally most accurate, while CCC performed comparably for lung cases. All deterministic algorithms exhibited systematic dose deviations in lung tissue and planning target volumes.
34, 29, and 28 non-dominated solutions form Pareto fronts for pile foundation designs under 500, 700, and 1000 kN loads. The dataset contains results from an automated system integrating a Multi-Objective Optical Microscope Algorithm with PLAXIS 2D FEM software, authored by Min-Yuan Cheng and uploaded in May 2026. It demonstrates cost savings of 78-85% compared to high-safety designs while maintaining safety factors 19-34% above regulatory requirements.
Daily global gridded estimates of atmospheric carbon dioxide (XCO2) are produced by assimilating Orbiting Carbon Observatory-2 (OCO-2) satellite retrievals into the Goddard Earth Observing System (GEOS) model. The dataset uses a 3D-Var data assimilation technique to gap-fill observations, addressing coverage gaps from OCO-2's narrow 10-km track and cloud interference. It provides a synthesized view of CO2 concentrations based on satellite measurements and atmospheric transport modeling.
Global gridded monthly data provides assimilated atmospheric carbon dioxide (XCO2) concentrations. The dataset is produced by NASA's Global Modeling and Assimilation Office using a data assimilation technique that synthesizes high-quality OCO-2 satellite retrievals with GEOS model simulations to fill observational gaps. This process yields a continuous, analyzed state of atmospheric CO2 based on scientific understanding of the carbon cycle and atmospheric transport.