Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
157,580 datasets
Gene expression panels for sex-specific bladder cancer biomarker discovery derived from RNA-seq data using machine learning. The dataset was created by Joseph R. Pizzi and published on figshare in April 2026. It contains results from applying four feature selection methods to identify robust gene signatures.
2021 Monthly Hillslope Cover Erosion provides monthly hillslope cover erosion rates in tonnes per hectare per month across New South Wales for the year 2021. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and was last updated on 2026-05-18. Data is available in PDF and GEOTIFF file formats under a CC-BY-4.0 license.
Demo data for the development of SATINN, a neural network-based approach to analyzing mouse seminiferous images. The 1.1 GB archive contains sample test images and pre-trained neural networks for initial use. Author Ran Yang uploaded it to figshare on 2026-05-31 under a CC-BY-4.0 license.
Individual Monthly Hillslope Cover Erosion data measured in tonnes per hectare per month for New South Wales throughout 2024. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and was last updated in May 2026.
A Colombian catalog consolidating economic activities, each identified by a unique ID, a standardized CIIU code, and a functional description. The dataset is structured based on the integration and adaptation of economic activity catalogs published by DANE and DIAN. It was last updated on 2026-05-18 and is available via the datos.gov.co portal.
New South Wales monthly hillslope cover erosion data for 2020, measured in tonnes per hectare per month. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and was last updated on the platform in May 2026. It is available in PDF and GEOTIFF formats.
Legacy product - no abstract available. The dataset is published by the Australian Ocean Data Network and was last updated on 2026-06-17. It is part of the Continental Margins Program Folio 7, focusing on the Vlaming Sub-basin offshore Western Australia.
Self-induced, canonicalized atomic-skill annotations for real-world instructional cooking videos from wikiHow and YouTube. The annotations were curated by AutoMark and include verb(object) calls with second-aligned intervals over a 48-skill canonical library. This dataset is a companion to robot-side annotations and was last updated on 2026-06-03.
Individual Monthly Hillslope Cover Erosion (t.ha-1.month-1) over New South Wales for 2010. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and is available in PDF and GEOTIFF formats under a CC-BY-4.0 license.
The dataset shows the number of applications for registration in the Registry of Forcibly Dispossessed and Abandoned Lands, as stipulated in Article 76 of Law 1448 of 2011. It is disaggregated by the department and municipality of the property location, as well as the type of right claimed by the applicant. The data is provided by www.datos.gov.co and was last updated on 2026-05-18.
2011 monthly hillslope cover erosion rates (t.ha-1.month-1) over New South Wales. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and is available in PDF and GEOTIFF formats.
2014 monthly hillslope cover erosion rates, measured in tonnes per hectare per month, across New South Wales. The dataset is provided by the NSW Department of Climate Change, Energy, the Environment and Water and is available in PDF and GEOTIFF formats.
Trafford Council's Constitution provides the framework for its decision-making and business conduct. The data consists of six separate PDF documents, last updated in May 2025. The dataset is published under the OGL-UK-3.0 license.
2020–2022 data on the sales-weighted distribution of cigarette prices. The dataset is a 5.5 KB Excel file authored by Mirjana Čizmović and shared under a CC-BY-4.0 license. It was last updated on June 2, 2026.
162 administrative regions are distinguished in this vector map of the Former Soviet Union's land area. The data set was derived from 1:3 million scale administrative boundaries published by ESRI in 1998. It provides a foundational geospatial layer for historical and regional analysis of the FSU.
Sports projects operated by the Pereira Mayor's Office through its Sports Secretariat across the city's neighborhoods and rural districts. The dataset includes project names, responsible organizations, addresses, services offered, and monthly beneficiary counts. It is published by www.datos.gov.co and was last updated on 2026-05-18.
A 2026 dataset by A. K. Singh contains PCA scores and cluster analysis results for 101 bael (Aegle marmelos) genotypes. The data likely includes scores from the first six principal components, which collectively explain 80.77% of total variability, and identifies superior genotypes CHESB-25 and CHESB-29. The dataset is intended to support breeding programs for developing higher-yield and better-quality cultivars.
Unidades de las Áreas Coralinas (Polígonos) provides the location and classification of Colombia's coral reef areas identified up to 2020. The data includes biotic, geomorphological, and ecological units for use in the "Atlas digital de las Áreas Coralinas de Colombia". It was last updated on 2026-05-18 and is hosted on the Socrata platform via www.datos.gov.co.
A study of 101 bael (Aegle marmelos) germplasms assessed genetic variability based on morphological and qualitative traits. The dataset includes measurements for traits like shell weight, fruit weight, and pulp weight, with heritability estimates ranging from 0.07% to 92.23%. Author A. K. Singh published the data on figshare under a CC-BY-4.0 license.
Southeast Australia's continental margin contains submarine canyons. The dataset is published by the Australian Ocean Data Network on data_gov_au and was last updated on 2026-06-16. It is a legacy product for which no abstract is available.