Loading...
Loading...
Mathematical datasets, statistical benchmarks, probability, optimization, operations research
2,489 datasets
Force spectroscopy records for analyzing protein unfolding and unbinding events, generated to validate the FEATHER algorithm. The data was used to demonstrate a 30-fold improvement in event location precision and an eightfold improvement in distribution accuracy compared to reference algorithms. It was authored by Patrick R. Heenan and published in 2020.
Data from a study evaluating Bayesian species delimitation methods for cryptic mycorrhizal fungi. It includes DNA sequence data for 7 nuclear and mtDNA loci from 147 fungal isolates obtained from the Australian orchid genus Caladenia. The analysis found strong support for eight distinct Serendipita species.
This dataset contains simulation results and empirical genetic data from the Sceloporus scalaris lizard species group, used to evaluate Bayes factor methods for species delimitation. It was created by Jared A. Grummer and published in 2020. The data supports the recognition of all scalaris group taxa and three previously undescribed lineages as independent evolutionary lineages.
Dryad hosts data from a study proposing new estimators (Crough and Cscaled) for mating preference strength in wild populations. The research compares these methods against the traditional Pearson's correlation coefficient (r), analyzing their performance under the scale-of-choice effect. The dataset supports the evaluation of mating preference in species like Echinolittorina malaccana, Littorina fabalis, and Echinolittorina radiata.
Experimental measurements of binary disc packings used to calculate configurational temperature via four distinct methods. Collected by Song-Chuan Zhao and published in 2020, the data evaluates the consistency of compactivity measurements in static granular media. The study specifically examines the intensive nature of thermodynamic variables in systems exceeding 200 particles.
This dataset supports a convex formulation for 3D Magnetic Particle Imaging (MPI) x-space reconstruction, addressing prior limitations of 1D approaches. It was created by Justin J. Konkle and published in 2020 under a CC0 license. The dataset is associated with research on reducing non-physical banding and haze artifacts in MPI.
Encompassing genetic data from a chorus frog species complex, used for Bayesian model comparison in phylogeographic inference. It includes three marker types: 23 anonymous nuclear sequence loci, four mitochondrial genes, and 14 microsatellite loci. The data was collected to test biogeographic hypotheses and assess genetic structure between Pseudacris streckeri and Pseudacris illinoensis populations.
Featuring simulated microsatellite data used to evaluate the power of approximate Bayesian computation (ABC) to distinguish between single and multiple introductions of invasive species from a single source. It was created by Sean Hoban and published in 2020, applying methods to three invasions of bumblebee in New Zealand.
Data from Gábor Lőrinczi investigates tool selection behavior in the ant species Aphaenogaster subterranea. It examines how ants choose and modify tools for liquid food transport across varied foraging challenges. The dataset was published in 2020.
This dataset compiles the largest collection of year-over-year Body Mass Index (BMI) changes for analysis. It was created by John C. Lang to study the statistical mechanics of human weight change, including drift and diffusion dynamics. The data supports a stochastic mathematical model investigating BMI distribution shape and potential social influence effects.
A study of 472 individuals from 32 Mediterranean populations of three Aegilops species (Ae. geniculata, Ae. neglecta, Ae. triuncialis) assessed introgression with wheat. Individuals were characterized using AFLP fingerprinting, Bayesian admixture estimation, fuzzy clustering, and wheat-specific transposable element insertion markers. The research detected substantial gene flow, particularly in Ae. neglecta and Ae. triuncialis near wheat fields.
Encompassing observations from 657 sunflower seedheads, focusing on the occurrence of Fibonacci and non-Fibonacci structure in their spiral patterns. It includes 768 evaluated clockwise and anticlockwise parastichy numbers, of which 565 were Fibonacci numbers. The study is the largest of its kind and provides independently checkable inclusion criteria.
2020 research data by Michelle F. DiLeo investigates landscape influences on gene flow for the eastern massasauga rattlesnake. The dataset contains results from spatial Bayesian clustering and landscape resistance analyses for two regional populations. It includes simulation outputs used to disentangle the effects of spatially confounded features like water bodies and roads.
A collection of simulation data used to validate a novel Bayesian multivariate antedependence model for whole-genome prediction of multiple quantitative traits. The data was generated by author Jicai Jiang to compare the proposed method's accuracy against traditional counterparts like Bayes A and multi-trait Bayes A. The dataset was last updated in June 2020.
A mathematical model describes the stepping behavior of cytoplasmic dynein 1, a dimeric motor protein, on microtubules. The framework examines experimental observations across species and predicts temporal dynamics of subunits like AAA+ rings and microtubule binding domains. It includes model extensions for backward stepping, variable step size, and dwelling.
Simulation data supports a study on the performance of the BPP program for Bayesian species delimitation. The dataset includes genetic sequence data and model outputs used to examine over-splitting tendencies and the distinction between population and species splits. It was created by Adam D. Leache and last updated in June 2020.
This dataset contains simulated and real genetic data used to evaluate the effect of close relatives on unsupervised Bayesian clustering algorithms. The research by Silvia T. Rodríguez-Ramilo demonstrates that including relatives can bias population genetic structure inference, and their removal improves analysis accuracy. The data supports investigations into Hardy-Weinberg and linkage disequilibrium caused by family structure within subpopulations.
Simulated and primate phylogenetic data support a study estimating speciation duration using a protracted birth-death model. The work implements a likelihood method from Lambert et al. (2014) to estimate parameters like speciation-initiation rate and extinction rate. The author, Rampal S. Etienne, applied the approach to a primate phylogeny and validated estimates with simulations.
A collection of empirical data from 12 able-bodied subjects performing goal-directed cursor movements with a computer mouse. It supports research on trial-by-trial adaptation rates and the analysis of steady-state learning phases. The data was used to demonstrate biases in conventional adaptation metrics and to validate a proposed Bayesian estimation approach.
Encompassing quantitative time-resolved measurements of TGFβ pathway activation at the single-cell level. The study uses dynamic time warping to compare signaling dynamics across thousands of individual cells and describes heterogeneous responses via mathematical modeling.