Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
142,125 datasets
Qatar National Library provides a high-resolution digital master copy of a 1690 manuscript from its Heritage Collection. The manuscript, authored by Abd al-Rahman al-Sufi, contains two pages from the 'Book of Fixed Stars'. The 172.3 MB ZIP file is available under a CC0 1.0 license.
geoBoundaries provides standardized, open-license administrative boundaries for every country globally. This dataset includes ADM0 (country), ADM1, and ADM2 level boundaries for Iraq, produced and maintained since 2017. It is available in GEOJSON and SHP formats under the ODbL-1.0 license.
Political administrative boundaries for The Gambia at four hierarchical levels (ADM0 to ADM3). The data is produced and maintained by geoBoundaries, an open-license global database of standardized boundaries for every country. The dataset was last updated on 2026-05-31.
Georgia's subnational administrative boundaries for ADM0 (country), ADM1 (region), and ADM2 (district) levels. The data is produced and maintained by the geoBoundaries Global Database of Political Administrative Boundaries Database, an open-license standardized resource for every country. The dataset was last updated on May 31, 2026.
A multivariable logistic regression model assesses factors associated with bacterial transmission clusters. Each observation is a unique introduction event, classified as a transmission cluster or a singleton case. The model, created by Erkison Ewomazino Odih and last updated in May 2026, uses consensus values for predictors like sequence type, site, and antimicrobial resistance gene carriage.
Victim data from 2019 onward, updated monthly between the 15th and 20th of each month. The dataset is published by the Unidad para la Atención y la Reparación Integral a las Víctimas (UARIV) on the Colombian open data portal. It contains 15 columns detailing demographic and event information for victims across departments.
United Republic of Tanzania's subnational administrative boundaries from country level (ADM0) down to the third subdivision (ADM3). The geoBoundaries Global Database produced and maintains this standardized, open-license resource of political boundaries for every country worldwide.
A GRADE evidence profile summarizing a meta-analysis of 23 studies on the effect of walking on creative thinking. The analysis includes data from 1,036 participants, primarily post-secondary students, and reports effect sizes for divergent and convergent thinking. The dataset was created by Alex Thabane and last updated in May 2026.
A systematic review and meta-analysis of 23 studies with 1,036 participants, primarily post-secondary students, investigating the effect of walking on creative thinking. The dataset, authored by Alex Thabane and last updated in May 2026, contains meta-analytic results showing a large effect on divergent thinking. It provides moderate certainty evidence for walking as an intervention to stimulate idea generation.
A 3.5 GB high-resolution digital master copy of manuscript HC.MS.2016.0030 from the Qatar National Library Heritage Collection. The manuscript is titled 'Kitab al-Miftah wa-Irtiyah al-Arwah' and is attributed to the author Muhammad ibn Sa'dun ibn Ali al-Qayrawani, who died in 1092. The dataset was published by Qatar National Library under a CC0 1.0 license and last updated on June 2, 2026.
Qatar National Library provides a 3.5 GB high-resolution digital copy of the manuscript 'Bahjat al-Albab fi 'Ilm al-Asturlab' by Ahmad ibn Rajab ibn Ṭanbughā al-Majdī. The manuscript is part of the QNL Heritage Collection and is available under a CC0 1.0 license. The dataset was last updated on June 2, 2026.
31 women with overweight or obesity participated in a randomized crossover trial comparing early and late time-restricted eating. Actigraphy data from the ChronoFast trial shows improvements in sleep efficiency and fragmentation specifically with early TRE. The dataset, authored by Beeke Peters, is a secondary analysis of the trial registered on ClinicalTrials.gov.
Cifras de Víctimas Territorial is a dataset from the Unidad para la Atención y la Reparación Integral a las Víctimas (UARIV) in Colombia. It contains monthly updated records from 2019 onward, with columns for demographics, events, and territorial location. The data is published via the Colombian open data portal, datos.gov.co, on the Socrata platform.
Monthly updated statistics on victims of territorial events in Colombia from 2019 onward, published by the Unit for Comprehensive Care and Reparation for Victims (UARIV). The dataset includes 17 columns such as event type, demographics, and location. Data is updated monthly between the 15th and 20th of each month.
Jamie Davis provides the production C++ source implementation for Davis Logic V2: Module 44 - Zero-Copy SPI Hardware Register Bridge. The module maps inbound serial peripheral interface (SPI) byte streams into structured spatial telemetry registries using bitwise unions. It was released open-access under CC BY 4.0 guidelines and last updated on May 30, 2026.
Annual land cover maps for the Colombian Amazon from 2001 through 2016. The maps were generated by applying a Random Forest classifier to time segments from the Continuous Change Detection and Classification algorithm on Landsat pixel surface reflectance data. This dataset was produced by the National Aeronautics and Space Administration and covers eight land cover classes including forest, pastures, and secondary forest.
16,038 adaptive networks with different structures across three parameter sets (V0, V1, V2). The dataset includes tables of edge probabilities, function categories for three-node networks, and weights for calculating functional diversity. It was authored by Debomita Chakraborty and last updated on 2026-05-28.
NASA's LBA-ECO CD-04 dataset provides 30-minute values for above-canopy meteorology and fluxes of momentum, heat, and carbon dioxide, along with within-canopy carbon dioxide and water vapor concentrations. Measurements were collected at 12 vertical levels between 10 cm and 64 m from a tower installed in a logging gap within the Tapajos National Forest, Brazil. Data collection spanned 1.5 years from June 2002 to January 2004.
Gridded monthly estimates of gross primary productivity (GPP), ecosystem respiration (Reco), and net ecosystem CO2 exchange (NEE) for the circumpolar terrestrial Arctic-boreal region at 1-km resolution. The dataset, produced by NASA using a random forest modeling framework, covers the period from 2001 to 2020. It includes aggregated annual averages, fire-adjusted NEE, and temporal trend rasters.
Anna Vágvölgyi's retrospective cohort study includes data from 475 couples undergoing IVF at the University of Szeged between January 2022 and December 2023. The dataset contains maternal demographics, reproductive history, hormone levels, ovarian stimulation characteristics, endometrial thickness, and results of vaginal and semen microbiological cultures. Machine learning models (SVM, RF, XGBoost) were applied to explore the predictive value of combined clinical and microbial features for clinical pregnancy.