Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,723 datasets
City of Hobart Open Data provides a dataset defining tracks and trails suitable for recreational use within the City of Hobart. The data includes attributes for permitted recreational use (Bike Track, Fire Trail, Shared Use, Walking Track), difficulty ratings (1-5) for biking and walking, and status (Open, Closed, Unmaintained, Closed for works). Data accessible in the open data portal is filtered for only those tracks that are City of Hobart owned or maintained.
A 1.6 KB CSV file containing data related to the cytotoxicity of a pyridine-hydrazone-derived Cu(II) complex (Cu2). The dataset, authored by Hai-Qun Zhang and last updated on 2026-04-21, likely contains measurements from experiments investigating the compound's mechanism of action in disrupting endoplasmic reticulum and mitochondrial networks to induce cell death in cancer models.
City of Hobart tracks and trails are classified by permitted recreational use, difficulty grades, and public status. The dataset includes attributes for bike and walking difficulty on a 1-5 scale and status categories like 'Open' or 'Closed for works'. Data is provided by HCCGISICT for the City of Hobart and was last updated in April 2026.
Unlinked civil registry data from over 30 city, regional, and provincial archives across the Netherlands. The dataset is cleaned and intended for researchers who wish to match civil registry data, with credit given to numerous Dutch archival institutions for data entry. Last updated on June 1, 2026.
David Peeters introduces a twelve-language dataset from a standardized online text elicitation study. It contains 2,400 short texts written by 1,200 participants describing two events across two discourse genres. The dataset was last updated on 2026-06-01.
Lippe brick-makers project data documents individuals from Lippe-Detmold undertaking seasonal migratory labor. The dataset, created by Jan Lucassen, includes information on workers' origins, destinations, employment, and personal details. It was last updated on June 1, 2026.
A timber sample from a stolp farmhouse at Westeinde 239 in Berkhout, Netherlands, was analyzed for dendrochronological research. The sample is a cross-section of Scots pine (Pinus sylvestris L.) with the wan edge present, and its last tree ring dates to 1607, indicating a felling interval in autumn/winter 1607/08. The farmhouse was likely constructed in 1608, and the dataset was authored by Sjoerd van Daalen and harvested by DataverseNL.
Subjects2K is a benchmark dataset of 2,000 image pairs designed for evaluating identity preservation in generated or edited images. It was created by chaenayo to evaluate the ID-Sim metric introduced in a CVPR 2026 paper. The dataset is a stratified subset derived from the larger Subjects200K collection.
Australian Ocean Data Network provides stratigraphic analysis of mid-Tertiary permeability barriers in the Murray Basin subsurface. The data includes facies analysis from borelogs and palaeogeographic reconstructions detailing marine incursions spanning at least 20 million years. This dataset was last updated in April 2026.
A dataset of 1,139 agrochemicals used to train a multimodal graph-learning framework for predicting ecotoxicity to bees. The model, developed by Xuanlin Chen and deployed in 2026, fuses semantic features from ChemFM with molecular graphs and structural features. It addresses class imbalance via cost-sensitive learning and achieves an area under the curve of 0.91 and a recall of 0.90.
Data files containing potential energy curves for electronic states, results with and without relativistic corrections, and electronic transition dipole and vibronic moments. The dataset is 65.9 KB in size and was authored by Chaima Hammami. It was last updated on 2026-05-20.
Parker Solar Probe SWEAP SPC Level 2 Ion Data contains measurements of ion flux as a function of energy and time, organized into spectra. The dataset covers all periods the instrument was operational in solar wind ion mode, including spacecraft maneuvers. Data is provided by the National Aeronautics and Space Administration (NASA) and was last updated in March 2026.
An inventory of public information generated, obtained, acquired, or controlled by obligated entities that has been classified or reserved. The dataset includes 16 columns detailing the classification process, legal basis, and responsible parties. It was last updated on 2026-05-18 and is hosted by datos.gov.co.
Mass resolved ion energy angle spectra covering nearly the full 4π solid angle and the energy range 15 eV/q to 33 eV/q. The dataset includes H+, O+, He+, and He++ number fluxes and statistical uncertainties processed by the TIMAS science team, with data acquired from the Polar satellite mission. It was produced by the National Aeronautics and Space Administration, with versions released from 1997 to 2002.
Business Licenses issued by the City of Chicago's Department of Business Affairs and Consumer Protection from January 1, 2002 to the present. The dataset is updated daily and contains records for initial issuance, renewals, and status changes. It includes business names, addresses, license terms, and geographic coordinates.
Cityworks Workorders is a work management dataset from the District Department of Transportation. It contains service requests for District assets including alleys, curbs, gutters, roadways, sidewalks, signage, signals, streetlights, and trees. The data is provided by the District of Columbia and was last updated on 2026-04-15.
UNHCR's Energy Monitoring Framework tracks outputs and impact of dollars spent on energy programming. The framework was developed starting in 2015 through consultations with government, private sector, and NGO partners to create standardized measures. This dataset contains survey results for Angola from the year 2019.
Burkina Faso survey data from 2017, collected by UNHCR under its Energy Monitoring Framework. The framework aims to track outputs and impacts of energy programming for refugees, developed through consultations with governments, NGOs, and field staff. More information is available on the official UNHCR EIS website.
A UNHCR survey dataset tracking the outputs and impact of energy programming expenditures in refugee contexts. The framework was developed starting in 2015 through consultations with government, private sector, and NGO partners to establish standardized measures. The data covers Bangladesh in 2018 and is provided by the UN Refugee Agency.
The UNHCR Energy Monitoring Framework survey for Cameroon in 2018 tracks outputs and impact of energy programming for refugees. It was developed starting in 2015 through a review of existing tools and consultations with government, private sector, and NGO partners. The dataset is provided by UNHCR - The UN Refugee Agency.