Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
44,611 datasets
A simulation dataset quantifies wind-induced load swing in tower cranes, revealing a maximum swing offset of 1.82 degrees and windward pressure errors exceeding 10%. Yu Sun created this dataset using a two-way fluid-structure coupling framework to model multi-physics interactions. It was last updated in April 2026.
A 5.5 KB Excel file contains simulation results quantifying wind-induced load swing for tower cranes. The dataset models the coupling between stochastic wind, flexible tower jib vibration, and load motion. Yu Sun authored this research, last updated in April 2026.
Current claims for opal exploration and production are mapped in the Lightning Ridge and White Cliffs districts of New South Wales. The dataset is provided by the Department of Regional New South Wales and was last updated on 2026-05-12. File formats include SEED WEB MAP, ESRI REST, WMS, and PDF.
268 full-text patents constitute this Chinese-English parallel corpus, sourced from public patent specifications. Each entry has been verified by human experts for compliance with patent translation and drafting standards. The dataset was created by PatSnap and forms part of the training data for Hiro-Translation.
Four comma-delimited files contain measurements from ten secondary forest study sites in Amazonas, Brazil, collected from 2000 to 2001. The dataset reports estimates of leaf area index, canopy cover, aboveground biomass, and basal area, classified by growth-form and diameter class. Data was produced by ORNL_CLOUD and is associated with research by Feldpausch et al. (2005).
A Romanian-language subset of the FinePDFs corpus, prepared for OCR, containing pairs of page images and extracted text. The dataset is part of an instruction fine-tuning protocol for Romanian Vision-Language Models proposed in the paper 'Înțelegi românește?'. It was created by OpenLLM-Ro and last updated on June 5, 2026.
Cincinnati's 311 service request system records all non-emergency citizen reports submitted via mobile app, online portal, or hotline. The dataset includes fields for request type, status, timestamps, and geocoded location details. It is published daily by the City of Cincinnati's Department of Public Services.
Geoscience Australia Data provides a historical report on the Adelaide River Uranium Mine in the Northern Territory. The description details development work and exploratory diamond drilling that identified a mineralized ore shoot called the Black Lode, containing about 70 tons per foot depth of ore averaging about 0.5% U3O8. The shoot was developed to a depth of 200 feet, with about 3,500 tons of ore treated at Rum Jungle and about 1,500 tons remaining broken in stopes.
Global satellite data, with concentrated coverage over polar regions, provides calibrated spectral radiance measurements. The dataset originates from the PREFIRE-SAT2 CubeSat, which carries a Thermal Infrared Spectrometer with 63 channels measuring radiation from approximately 5 to 53 µm. Its primary purpose is to fill knowledge gaps in the Earth's energy budget by characterizing far-infrared emissions from the poles for assimilation into climate models.
PREFIRE CubeSats carry a push broom spectrometer with 63 channels measuring mid- and far-infrared radiation from approximately 5 to 53 µm. This telemetry dataset provides the time, beta angle, orbit position and velocity, and quaternion for each satellite, enabling geolocation of spectral measurements. The data is retrieved at 1Hz and aims to fill knowledge gaps in the polar radiant energy budget for climate models.
Polygons delineating areas of public land and local council land classified as public open space across 31 municipalities in metropolitan Melbourne. The dataset, provided by the Department of Energy, Environment and Climate Action, includes attributes describing the open space category and owner. It was last updated on 2026-04-08.
A study by Binbin Wang introduces a composite Gaussian model decomposition method for estimating forest canopy height from GEDI L1B LiDAR data. The method is validated on a selected forest farm and another study area, showing statistically significant improvements over traditional methods. The dataset, last updated in 2026, is a 44.5 KB document describing the methodology and results.
From 2016 to 2021, seven Dutch water boards conducted the Insight in Baggeraanwas (IBA) study to determine sediment accumulation rates in waterways. The dataset includes raw and processed data from approximately 135 measurement locations, featuring manual and single-beam measurements, photographs, and field forms. The data was published by the Dutch Ministry of the Interior and Kingdom Relations under a CC0-1.0 license.
667 original square mile sheets at 1:25,000 scale were produced for Prussian state recording from 1816 to 1821. A subset of 9 sheets covering the area around Berlin was reduced to a scale of 1:50,000 and published as facsimile prints by the LGB from originals held by the Staatsbibliothek zu Berlin. The total mapped area extends from Kremmen and Oranienburg in the north to Mittenwalde and Storkow in the south, and from Nauen and Ketzin in the west to Strausberg in the east.
Sentinel-5P's TROPOMI instrument is a nadir-viewing hyperspectral spectrometer covering ultraviolet to shortwave infrared wavelengths. Its Level-1B product provides calibrated radiance and irradiance data, with a spatial resolution of approximately 5.5 km at nadir implemented from August 6, 2019. The data is processed by the Koninklijk Nederlands Meteorologisch Instituut (KNMI) and includes dynamic straylight and residual background corrections starting with version 3.
Global satellite data is collected by the Sentinel-5P TROPOMI instrument, a hyperspectral spectrometer covering ultraviolet-visible, near infrared, and shortwave infrared wavelengths. The Level-1B product provides calibrated radiance, irradiance, and engineering data for atmospheric monitoring. It features a nadir-viewing 108-degree field of view and, from August 6, 2019, an along-track spatial resolution of approximately 5.5 km.
Quebec's Ministry of Economy and Innovation administers the NovaScience program, which financially supports the development of scientists. The program promotes science in schools, business R&D, internships, and projects related to sustainable development and gender diversity. The dataset is available under a CC-BY-4.0 license and was last updated on April 17, 2026.
The Museum of Contemporary Art of Montreal identified a corpus of works deemed royalty-free. Data includes artist, title, date, category, materials, dimensions, origin, events, and related publications, with links to downloadable images. The MAC uses the Creative Commons public domain trademark to clarify the legal status of this content.
A historical map titled 'The Margraviate and the Electorate of Brandenburg' published in 1696 by Nicolaus Sanson d’Abbeville d.E. in Paris. The reproduction is a colored copper engraving measuring 98 cm x 67 cm, copied by the LGB in cooperation with the Brandenburg State Archives. It represents the region on a scale of approximately 1:450,000.
MED-ECHO data compiled by the Régie de l'assurance Maladie du Québec covers hospital stays across Quebec. This table reports the number of departures, total stay, and average stay for long-term care users occupying short-term beds by treatment region. The data is sourced from hospital centers serving the MED-ÉCHO clientele.