Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,331 datasets
A Digital Landscape Model for Germany transformed for INSPIRE themes. The data covers the themes Transport Networks, Hydrography (Networks and Physical Water), Administrative Units, and Protected Sites. It is provided by the Bundesamt für Kartographie und Geodäsie via a WFS service.
Goshime Muluneh Mekasha published trait BLUE mean performance data for test crosses and checks. The dataset contains results from evaluations conducted in drought stress and well-watered environments in Kenya during 2019. It is a 5.5 KB XLS file available under a CC-BY-4.0 license.
Interprovincial tourism expenditures data tracks spending by travelers according to their province or territory of residence and the location of the spending. The dataset is published by Statistics Canada and is available in XML, CSV, and HTML formats. It was last updated on 2026-05-26.
September 2021 records of beneficiaries enrolled in the Colombian government's Families in Action social welfare program. The dataset is hosted by the Colombian open data portal, www.datos.gov.co, and was last updated on 2026-05-18. It includes columns for document type, beneficiary code, last name, and neighborhood.
2021 innovation indicators from Brazil's Semi-Annual Innovation Survey (PINTEC Semestral), conducted by IBGE in partnership with ABDI and UFRJ. The dataset covers industrial companies in Mining/Extractive and Manufacturing sectors with 100 or more employees, containing 20 tables broken down by economic activity and employment-size band. Data were restructured from IBGE's public FTP server for publication as tabular data on Dataverse.
Respostas ao Pré Teste e Diagnóstico contains responses to a pre-test and diagnostic assessment. The dataset is published on figshare under a CC-BY-4.0 license by an anonymous researcher. It was last updated on 2026-05-31.
A tabulated version of Lincolnshire County Council's Council Business Plan, archived after changes in publication methods. The dataset is published by the Government Digital Service under the OGL-UK-3.0 license and is available in CSV format.
Wilson Castro's dataset characterizes the physicochemical properties of Creole chicken breast meat at two postmortem time points. The data is stored in an XLS file with a size of 9.5 KB and was last updated on 2026-05-19. It is shared under a CC-BY-4.0 license on the figshare platform.
A collection of T1-weighted contrast-enhanced brain MRI images with pathologically confirmed brain tumors. The dataset, derived from the Jun Cheng Brain Tumor Dataset (JCBTD), includes manually delineated tumor masks. It was converted from MATLAB files and uploaded by chehablab to Hugging Face, with a last recorded update in June 2026.
12.4 KB Excel file from a study linking obesity and cancer. Sophie Pénisson published this supplementary table detailing participant characteristics from a kidney autopsy cohort. The data was last updated on June 1, 2026.
Opal patronage data for train, bus, ferry, and light rail services since January 2020. The dataset is provided by Transport for NSW and was last updated on 2026-05-17. Data is available by transport mode, day of the week, and for key commercial centers in greater Sydney and regional NSW.
A table listing antibodies and their dilutions used in a research study on chordoma therapy. The dataset was authored by Tianna Zhao and published on figshare under a CC-BY-4.0 license. It was last updated on June 1, 2026.
TCGA and DepMap data, related to Figure 5 and S4 of a study on MDM2-amplified liposarcoma. The 1.3 MB XLSX file was authored by Thijs Jalving and shared under a CC-BY-4.0 license. It was last updated on June 1, 2026.
16.0 KB of gene sets used in a study on MDM2-amplified liposarcoma, related to Figures 3 and S3. The data is provided by author Thijs Jalving and was last updated on June 1, 2026. It is shared under a CC-BY-4.0 license.
A supplementary table from a research article on TGFβ signaling inactivation in advanced pancreatic cancer. The table lists somatic mutations, mutational signatures, and Cancer Cell Fraction (CCF) estimates. It was authored by Jungeui Hong and last updated on June 1, 2026.
A pancreatic ductal adenocarcinoma (PDAC) cohort dataset from figshare. It lists genetic details and clonality of KRAS and TP53 mutations. The dataset is 36.0 KB in size, authored by Jungeui Hong, and was last updated on June 1, 2026.
Supplementary Table S5 lists pancreatic cancer samples exhibiting multiple KRAS alleles and their corresponding Cancer Cell Fraction (CCF) values. The dataset was authored by Jungeui Hong and published on figshare under a CC-BY-4.0 license. It was last updated on June 1, 2026.
A supplementary table from a research article on pancreatic cancer. It lists genes significantly amplified or deleted for different treatment plans, as identified by the GISTIC2.0 algorithm. The dataset was authored by Jungeui Hong and last updated on June 1, 2026.
A supplementary table listing significantly amplified or deleted pancreatic ductal adenocarcinoma (PDAC) drivers. The data is categorized by treatment type and sample origin (primary versus metastatic) based on GISTIC2.0 analysis. It was authored by Jungeui Hong and last updated on June 1, 2026.
Supplementary Table S9 from the article 'Convergence for Inactivation of TGFβ Signaling Is a Common Feature of Advanced Pancreatic Cancer' details molecular pathways altered by different treatment plans. The dataset was authored by Jungeui Hong and last updated on 2026-06-01. It is a small 11.1 KB XLSX file shared under a CC-BY-4.0 license.