Loading...
Loading...
General ML benchmarks, tabular data, AutoML, recommendation systems, anomaly detection, evaluation suites
168,494 datasets
Transfer records from lotteries to the health sector, tracked by the Colombian open data portal. The data includes the transferred value, department, year, and month for each transaction, sourced from www.datos.gov.co. The dataset was last updated on 2026-05-18.
From December 2016 to January 2019, the WorldView-4 satellite collected panchromatic imagery across the global land surface. This Level 1B product provides sensor-corrected, unprojected imagery with a spatial resolution of 0.31 meters at nadir and a temporal resolution of approximately 1.1 days. The data is provided in NITF and GeoTIFF formats.
Yogi Tri Prasetyo's raw data for the study "Beyond Monetary Incentives: Determinants of Plastic Waste Exchange Participation and Continued Pro-Environmental Behavior." The dataset is 97.4 KB in size and was last updated on May 31, 2026. It is available under a CC-BY-4.0 license on figshare.
A list of sports and recreation facilities in Neiva, Colombia, for the year 2022. The dataset includes fields for neighborhood, district, address, and a Google Maps link. It is published on the Colombian open data portal, datos.gov.co, and was last updated in May 2026.
Budget execution data from the Municipality of Oporapa details spending and investments for the 2020 fiscal year. The dataset includes 17 columns tracking the budget lifecycle from initial allocation to final payments. It was published on the Colombian open data portal, datos.gov.co, and last updated in May 2026.
1985-2035 demographic indicators for Manizales, Colombia, including population counts by gender and area, and various dependency and age indices. The dataset is hosted on the Socrata platform via datos.gov.co and was last updated in May 2026. Columns suggest annual time-series data for a 50-year period.
A filtered subset of the institutional-books-1.0 dataset containing English-language texts. The dataset applies filters for language and publication date, selecting books where the parsed year is before 1930. It was created by jbduran and last updated on 2026-06-04.
76 years of publishing data supports a 2026 bachelor's thesis on indirect translations. The dataset is a 400.3 KB Excel file created by Daniel Wahlström and shared under a CC-BY-4.0 license. It was last updated on May 25, 2026.
651 research papers on mechanistic interpretability, including circuits, superposition, and feature visualization. The collection is continuously updated and deduplicated by FineSet from arXiv and Semantic Scholar. Records include a citation-normalized quality score ranging from 0 to 1.
A 40.8 KB dataset containing raw and statistically transformed data for an analytical study on the indicator-based assessment of socioeconomic development of municipalities in Nowosądecki County, Poland. The data was authored by Roman Berdo and uploaded by sigma sigma to figshare under a CC-BY-4.0 license. The dataset was last updated on 2026-05-18.
A 5.5 KB dataset of properties for excitatory cells studied in tangential and coronal slices of the somatosensory cortex. Values represent mean and standard error, with p-values from uncorrected t-tests. The dataset was authored by Omer Revah and last updated on May 14, 2026.
Colombian municipal public procurement data for Caracolí from 2020. The dataset includes contract details such as process type, entity, value, and status, sourced from the national open data portal. It was last updated on the platform in May 2026.
A 2022 dataset from datos.gov.co detailing campaign expenditures for congressional candidates in Colombia. It likely contains records of spending by political parties and candidates, with columns for expense type, amount, candidate information, and geographic location. The data was last updated on the platform in May 2026.
A time-series dataset tracking economic indicators across Colombia. It is sourced from the Colombian open data portal (www.datos.gov.co) and was last updated on 2026-05-18. The data includes columns for indicator category, month, year, and original value.
Gohmann, Tsai SV40 PLOS Pathogens is a dataset by Luke Gohmann, last updated on June 4, 2026. The data is stored in a PRISM file format and is 248.5 KB in size. It is shared under a CC-BY-4.0 license on the figshare platform.
Gohmann and Tsai SV40 PLOS Pathogens is a dataset by Luke Gohmann, published on figshare. It is a small dataset, 9.9 KB in size, last updated on June 4, 2026. The data is shared under a CC-BY-4.0 license.
Siembra de arboles en el municipio de Palmira en las diferentes comunas y barrios en la vigencia 2022. The dataset was last updated on 2026-05-18 and is provided by www.datos.gov.co. It includes columns for location, tree species, and planting actions.
Esquema de Publicación de Información CAR-CVS is a structured catalog from Colombia's open data platform. It lists information resources published by obligated entities under Law 1712 of 2014. The dataset includes columns for format, update frequency, information title, responsible party, and access methods.
Luke Gohmann's SV40 data repository for PLOS Pathogens, published under a CC-BY-4.0 license. The dataset is 376.1 KB in size and was last updated on June 4, 2026. It is hosted on the figshare platform.
Lakes and ponds within the City of York, United Kingdom, are represented in this dataset. The data is published by the Government Digital Service and sourced from the City of York Council's live GIS server via an API link. The dataset is available in multiple geospatial formats, including GeoJSON, KML, and CSV.