Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,787 datasets
Preprocessed 3D assets for use with SceneSmith, a VLM-agent-based system for generating physically realistic, interactive indoor scenes. The data includes simulation-ready articulated objects like cabinets, drawers, and appliances converted from the ArtVIP dataset. It was created by author nepfaff and last updated on May 31, 2026.
Experimental measurements characterize the harmonic behavior of multiple electric vehicle models during AC charging. The data includes charging powers ranging from 1.2 kW to 11 kW and examines the effect of state of charge on harmonic distortion. The dataset was authored by Hazem Kasem and last updated on 2026-04-23.
Three sediment cores from Nara Inlet in Australia's central Great Barrier Reef document sediment accumulation over the last 3000 years. The Australian Ocean Data Network published this analysis of clastic and carbonate sediment composition and accumulation rates. The data originates from a scientific journal paper last updated in 2026.
Development plan No. 12 for the residential area southern Heinersdorfer Weg is a binding land-use plan from the City of Teltow. The plan transposes the municipal land-use concept into directly applicable law, specifying permitted and inadmissible land uses on the affected base areas. The dataset is provided by the Bundesamt für Kartographie und Geodäsie via a WMS service.
Cartagena District's daily budget execution data for central administration expenditures. The dataset includes columns for budget stages like appropriation, commitment, certification, and payment. It was last updated on 2026-05-18 and is provided by datos.gov.co.
WFS Wärmekataster Hamburg provides geospatial data on B-plans (development plans) with energy-related specifications in Hamburg. The dataset originates from the Bundesamt für Kartographie und Geodäsie and is delivered via a WebFeatureService (WFS). Detailed information is referenced to a heat register manual.
A binding municipal land-use plan for the centre of Schönerlinde in the municipality of Wandlitz, Germany. The plan transposes the municipal land-use concept into directly applicable law, specifying permitted and inadmissible land uses. The dataset is provided by the Bundesamt für Kartographie und Geodäsie via a Web Feature Service (WFS).
CLIMCAPS algorithm produces Level 2 cloud-cleared radiances from the CrIS/ATMS instruments on the Suomi NPP satellite. The dataset includes 1305 infrared channels and 22 microwave channels, processed into granules of 6-minute intervals covering 30 footprints cross-track by 45 lines along-track. It uses MERRA-2 reanalysis as a first-guess, resulting in a latency of 3 to 7 weeks.
A legacy dataset from the Australian Ocean Data Network concerning the morphology of the continental shelf off eastern Australia. The data likely contains information related to offshore heavy-mineral prospects. The record was last updated on 2026-06 04.
Myanmar government articles extracted directly from the Ministry of Information website. The dataset is curated by DatarrX to foster research and development of the Burmese language for Natural Language Processing and Machine Learning. The dataset page was last updated on May 31, 2026.
Over 26 years of reported environmental spill incidents in Connecticut, from July 1996 to June 2022. The dataset contains records of substance releases reported to the Connecticut Department of Energy and Environmental Protection, including details on materials, quantities, locations, and responsible parties.
NASA's SASSIE project collected airborne microwave radiometer data to study how melting sea ice affects ocean salinity and temperature. The Passive-Active L-Band System (PALS) sampled the Beaufort Sea from August to October 2022, providing brightness temperature data at 1.4GHz gridded at approximately 2x2km resolution. This data was processed to derive sea surface salinity using the Klein-Swift retrieval model, supporting research on upper ocean stratification and ice growth.
From 7 July 1978 to 10 October 1978, this dataset contains wind speeds and directions derived from the Seasat-A Scatterometer (SASS). The product, produced by Robert Atlas et al. in 1987, uses an objective ambiguity removal scheme to dealias wind vector data binned at 100 km cells, originally calculated by Frank Wentz. It presents the data chronologically by satellite swath.
Workshop notes compiled for an international scientific workshop on the geology, mineral resources, and geophysics of the South Pacific. The dataset is published on data_gov_au and hosted by the Australian Ocean Data Network. It was last updated on 2026-06-04.
New Zealand data from a 2024 spring calving season trial on 129 dairy cows from two commercial farms. The dataset compares serum calcium, magnesium, and phosphorus concentrations over 72 hours in cows treated with a new calcium-plus-vitamin-D bolus, a commercial calcium-only bolus, or no treatment. The research was authored by Emma Cuttance and shared under a CC-BY-4.0 license.
KirillNik created a corpus of synthetic tweets generated by three open or API-based large language models. The dataset is designed for controlled-variable studies on detecting machine-generated social-media text, with topics extracted from real human tweets. It was last updated on June 4, 2026.
Legacy product from the Australian Ocean Data Network describing heavy-mineral sand deposits along the Western Australian coast. The dataset is published on data_gov_au and was last updated on 2026-06-04. No abstract or detailed sample data is available for this resource.
Characterization data for a FeRh thin film sample, its phase transition, and generated surface acoustic wave pulses. The processed data is linked to figures in a scientific manuscript about laser-generated GHz surface acoustic waves with tunable amplitude during a magnetostructural phase transition. The dataset was authored by Iaroslav Mogunov and last updated in April 2026.
196 bytes of Atomic Cluster Expansion (ACE) models generated as part of an iterative procedure for predicting vacancy formation energies at bulk and sigma5 grain boundary sites. The models were created by Hariharan Umashankar and last updated on April 24, 2026. They are intended to be used with a script provided in a linked GitHub repository.
Boundaries represent defined urban character areas in Ipswich, distinguished by shared physical characteristics like land use, topography, and architectural style. The dataset is intended as a 'material consideration' within the local planning system, published by the Government Digital Service. The accompanying Supplementary Planning Document is published on the Council's website.