Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,281 datasets
Henry D. Kalter's dataset, last updated on 2026-05-22, examines associations between perceived illness severity at onset and outcomes for fatal illnesses in neonates and infants aged 1-11 months. The 17.5 KB XLS file likely contains tabular data on age at death and formal care-seeking behaviors for illnesses that began in community settings. Its specific focus is on the initial perception of severity and its relationship with mortality and healthcare utilization.
A figshare dataset by Brent D. Mishler, last updated May 22, 2026. It contains counts of highly endemic branches for three categories (neo-endemic, meso-endemic, paleo-endemic) across seven comparison datasets and a primary Acacia dataset. The data is stored in a 5.5 KB XLS file.
Germany's land cover data from the CORINE Land Cover 5ha (CLC5) project for 2015. The dataset is transformed for the INSPIRE theme Land Cover and provided via a Web Feature Service (WFS) by the Bundesamt für Kartographie und Geodäsie.
Descriptive statistics of positional Shannon entropy within viral genotypes. The dataset includes metrics such as the number of analyzed positions, mean, median, standard deviation, quantiles, and conservation index. It was authored by Alan López Leal and last updated on May 15, 2026.
Gene-level entropy summaries derived from consensus alignments across Human Papillomavirus genotypes. The dataset includes mean entropy, median entropy, interquartile range, median absolute deviation, and percentages of conserved and highly variable positions. Author Alan López Leal published the 9.8 KB XLSX file on figshare under a CC-BY-4.0 license, last updated on 2026-05-15.
500 antimicrobial susceptibility tests (ASTs) have been evaluated and categorized into five templates. The dataset was authored by Patricia Orlandi Barth and is available on figshare under a CC-BY-4.0 license. It was last updated on May 8, 2026.
Ulrikke Norill Kvalvaag's dataset compares executive functions between match-selected academy players, non-match-selected academy players, and club players. The analysis uses a one-way ANOVA with post hoc tests on a sample of 53 participants. Values are presented as mean and standard deviation.
55 soccer players' motivation scores compared across three groups: match-selected academy players, non-match-selected academy players, and club players. The data is presented as means and standard deviations from a one-way ANOVA analysis. Authored by Ulrikke Norill Kvalvaag and last updated in May 2026.
A dataset comparing mental skills across three groups of soccer players: match-selected academy players, non-match-selected academy players, and club players. The data includes mean and standard deviation values for 55 participants, analyzed using one-way ANOVA with post hoc tests. The dataset was authored by Ulrikke Norill Kvalvaag and is available in XLS format.
52 academy and club soccer players were tested on IR1 and 40-meter linear sprint performance. The dataset compares match-selected academy players, non-match-selected academy players, and club players using one-way ANOVA with post hoc tests. Ulrikke Norill Kvalvaag published the results on figshare in May 2026.
Ulrikke Norill Kvalvaag's dataset compares chronological age, bone age, height, and weight between match-selected academy players, non-match-selected academy players, and club players. It contains summary statistics for 56 participants, with values presented as mean and standard deviation. The data was last updated on figshare in May 2026.
June 2019 passenger departure records from Medellín's bus terminals, detailing passenger counts, destinations, and transport companies. The dataset was published on the Colombian open data portal www.datos.gov.co and was last updated on 2026-05-18 19:25:25. It includes columns for scheduled and actual departure times, routes, and company names.
POBLACION CENSO NACIONAL 2018 POR CENTRO POBLADO, SEXO Y GRUPO ETAREO - DEPARTAMENTO DEL MAGDALENA contains population counts from the 2018 national census for the Magdalena Department in Colombia. The data is broken down by municipality, populated center type, sex, and 5-year age groups. It was published on the Colombian open data portal www.datos.gov.co and last updated on 2026-05-18.
From 2016 to 2020, this dataset details public procurement contracts for the municipality of Caracolí, Colombia. It includes columns such as PROCESO DE CONTRATACIÓN, TIPOLOGÍA, VALOR DEL CONTRATO, and FECHA FIRMA. The data is hosted by the Colombian open data portal, www.datos.gov.co, and was last updated in May 2026.
Major incidents on the Long Island Rail Road are defined as events causing ten or more trains to be delayed over 5 minutes 59 seconds, canceled, or terminated. Columns suggest the dataset tracks these incidents by date, with detailed counts of affected trains segmented by AM peak, PM peak, and off-peak service periods. The data is available on multiple government open data platforms, indicating its use for public accountability.
A catalog documenting all information assets of an entity, detailing asset type, description, responsible party, location, and format. The dataset includes columns such as 'Medio de conservación y/o soporte', 'Idioma', 'Responsable de la producción de la información', 'Formato', and 'Lugar de consulta'. It is hosted on the Colombian open data platform www.datos.gov.co and was last updated on 2026-05-18.
Public records of cancelled legal entity registrations and their board members, published by the Dosquebradas Chamber of Commerce. The dataset includes columns such as 'organizacion', 'razonsocial', 'cargo', 'nombre', and 'feccancelacion'. It was last updated on 2026-05-18 18:45:43 via the datos.gov.co platform.
Pre-parsed English and French Wikipedia articles extracted using the Wikimedia Enterprise Snapshot API. The dataset contains all articles from these language editions, output as structured data. It was created by surucu35 and was last updated on June 15, 2026.
Autozyme Datasets provides preprocessed single-cell RNA sequencing data for the AutoZyme project. The collection includes raw and fully processed count matrices in both Seurat (R) and Scanpy (Python) formats. The dataset was authored by elliotxie and last updated on Hugging Face in June 2026.
7.7 MB of data for zinc ion batteries with an amorphous manganese dioxide cathode, shared by Abhishek Lahiri on figshare. The dataset is available in XLSX format under a CC-BY-4.0 license and was last updated on June 1, 2026.