Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
42,922 datasets
Australian Ocean Data Network collected this weather sensor data from the NERP Weather Station site on Thursday Island. The dataset includes rain measurements from 8 February 2012. The record was last updated on 17 June 2026.
Saibai Island wind data collected by weather sensors deployed on a National Environmental Research Program (NERP) weather station. The dataset was published by the Australian Ocean Data Network and was last updated on the platform in June 2026.
From April 27, 2016, to March 5, 2021, humidity data was collected by weather sensors deployed on Saibai Island. The data set was collected by the NERP Weather Station and aggregated by the Australian Ocean Data Network. The dataset was last updated on June 17, 2026.
Global Affairs Canada conducts internal audits of its foreign affairs, international development, and trade programs. The audit process reviews management practices and controls to ensure the efficient use of public funds, resulting in recommendations and detailed management action plans. The dataset likely contains audit findings and follow-up guidance from the Office of the Chief Audit Executive.
Global Affairs Canada's internal audit process reviews management practices for IT security threat and vulnerability management. The audit report includes recommendations and detailed management action plans to address identified gaps. The Office of the Chief Audit Executive provides follow-up guidance to ensure progress on implementing these plans.
Data collected by weather sensors deployed on the NERP Weather Station site at Masig Island. The dataset is provided by the Australian Ocean Data Network and was last updated on June 17, 2026. Available file formats include PNG and HTML.
Rainfall data collected by weather sensors deployed on the NERP Weather Station site on Masig Island. The dataset was gathered by the Australian Ocean Data Network and includes records from at least July 26, 2013. The data is available in PNG and HTML file formats.
HIWRAP radar data was collected from the Global Hawk aircraft during the Hurricane and Severe Storm Sentinel (HS3) campaign. The dual-frequency, conically scanning system mapped tropospheric winds and precipitation using Doppler velocity and reflectivity profiles. This dataset supports research on tropical cyclone formation and the role of the Saharan Air Layer.
Topographic and geomorphological data for four Intensive Study Areas (ISAs) in the Northern Ecuadorian Amazon, part of the University of North Carolina's Carolina Population Center Ecuador Projects. The dataset includes study area boundaries, point elevation features, 20-meter elevation contours, and derived digital elevation models (DEMs), terrain aspect, and slope layers. Data are provided in ESRI shapefile and GeoTiff formats across six compressed files.
Weather sensor data collected from the NERP Weather Station site at Bramble Cay. The Australian Ocean Data Network hosts this dataset, which was last updated in June 2026. The data likely contains environmental measurements from a remote island location.
Weather sensor data collected from the NERP Weather Station site on Thursday Island. The dataset was published by the Australian Ocean Data Network and last updated in June 2026. Its specific temporal coverage begins on 08 February 2012.
April 2026 results from a study evaluating ChatGPT-5 and Grok-4 on sleep medicine tasks. The dataset likely contains performance metrics from 79 clinical vignettes and 897 multiple-choice questions, authored by Anshum Patel and shared under a CC-BY-4.0 license.
Supplementary Material for: Ask the Right Questions: Prompting Strategies Shape LLM Performance on Biliary Tract Cancer Guideline Queries contains the results of a cross-sectional analysis evaluating three advanced LLMs (GPT-4o, Claude 3.5 Sonnet, Llama 3 70b) on 40 clinical questions derived from ESMO guidelines. The dataset likely contains model responses evaluated for accuracy, conciseness, evidence quality, and hallucination rates under three prompting strategies. It was uploaded by figshare admin karger on 2026-04-29.
872 women who underwent cervical conization and completed at least one postoperative HPV test form this retrospective cohort. The dataset, authored by Jie Zhou and last updated in April 2026, likely contains longitudinal clinical records used to model HPV positivity over time using generalized additive mixed models.
2025 data combines Computer-Assisted Mass Appraisal (CAMA) reports and parcel geometries for 169 municipalities in Connecticut. The State of Connecticut compiled this dataset from annual submissions by regional councils of governments, processing it with Python and ArcGIS Pro in September 2025. It integrates standardized property assessment attributes with geospatial parcel boundaries.
Four Excel files contain raw force-displacement hysteresis curves from cyclic loading tests on precast self-insulating shear wall specimens ZBW-1 to ZBW-4. The data was collected by meng wei under quasi-static cyclic lateral loading conditions to investigate seismic performance. The dataset was last updated on 2026-05-13.
TOTAL infractions applied under the National Code of Security and Citizen Coexistence, recorded by DEPARTAMENTO, MUNICIPIO, and PERIODO. The dataset is hosted on the Colombian open data platform www.datos.gov.co and was last updated on 2026-05-18. It likely contains records of corrective measures issued by the National Police.
Root trait data from hydroponic experiments on three near-isogenic soybean lines with contrasting salt tolerance alleles. The dataset includes TIF and DOCX files measuring root dry weight, volume, lateral root number, surface area, and hydraulic resistance under varying salt concentrations. Authored by Qiuzhi Rui and last updated on 2026-05-08.
Wandong Liu published a dataset on figshare in May 2026 describing covalent inhibitors for the New Delhi metallo-Îē-lactamase-1 (NDM-1) enzyme. The data likely contains molecular structures and experimental results, including an IC50 value of 3.27 ΞM for the lead compound 16a. The dataset is 284.5 KB in size and includes PDB files.
A figshare dataset by Wandong Liu, last updated in May 2026, contains Protein Data Bank (PDB) files related to covalent inhibitors of the New Delhi metallo-Îē-lactamase-1 (NDM-1) enzyme. The data supports research into reversing carbapenem antibiotic resistance, specifically for the inhibitor compound 16a derived from a cephalexin scaffold. The dataset size is 285.8 KB.