Loading...
Loading...
Text classification, translation, QA, summarization, dialogue, sentiment analysis, language modeling, text corpora
42,963 datasets
Weather sensor data collected from the NERP Weather Station site at Bramble Cay. The Australian Ocean Data Network hosts this dataset, which was last updated in June 2026. The data likely contains environmental measurements from a remote island location.
Weather sensor data collected from the NERP Weather Station site on Thursday Island. The dataset was published by the Australian Ocean Data Network and last updated in June 2026. Its specific temporal coverage begins on 08 February 2012.
April 2026 results from a study evaluating ChatGPT-5 and Grok-4 on sleep medicine tasks. The dataset likely contains performance metrics from 79 clinical vignettes and 897 multiple-choice questions, authored by Anshum Patel and shared under a CC-BY-4.0 license.
Supplementary Material for: Ask the Right Questions: Prompting Strategies Shape LLM Performance on Biliary Tract Cancer Guideline Queries contains the results of a cross-sectional analysis evaluating three advanced LLMs (GPT-4o, Claude 3.5 Sonnet, Llama 3 70b) on 40 clinical questions derived from ESMO guidelines. The dataset likely contains model responses evaluated for accuracy, conciseness, evidence quality, and hallucination rates under three prompting strategies. It was uploaded by figshare admin karger on 2026-04-29.
872 women who underwent cervical conization and completed at least one postoperative HPV test form this retrospective cohort. The dataset, authored by Jie Zhou and last updated in April 2026, likely contains longitudinal clinical records used to model HPV positivity over time using generalized additive mixed models.
2025 data combines Computer-Assisted Mass Appraisal (CAMA) reports and parcel geometries for 169 municipalities in Connecticut. The State of Connecticut compiled this dataset from annual submissions by regional councils of governments, processing it with Python and ArcGIS Pro in September 2025. It integrates standardized property assessment attributes with geospatial parcel boundaries.
Four Excel files contain raw force-displacement hysteresis curves from cyclic loading tests on precast self-insulating shear wall specimens ZBW-1 to ZBW-4. The data was collected by meng wei under quasi-static cyclic lateral loading conditions to investigate seismic performance. The dataset was last updated on 2026-05-13.
TOTAL infractions applied under the National Code of Security and Citizen Coexistence, recorded by DEPARTAMENTO, MUNICIPIO, and PERIODO. The dataset is hosted on the Colombian open data platform www.datos.gov.co and was last updated on 2026-05-18. It likely contains records of corrective measures issued by the National Police.
Root trait data from hydroponic experiments on three near-isogenic soybean lines with contrasting salt tolerance alleles. The dataset includes TIF and DOCX files measuring root dry weight, volume, lateral root number, surface area, and hydraulic resistance under varying salt concentrations. Authored by Qiuzhi Rui and last updated on 2026-05-08.
Wandong Liu published a dataset on figshare in May 2026 describing covalent inhibitors for the New Delhi metallo-β-lactamase-1 (NDM-1) enzyme. The data likely contains molecular structures and experimental results, including an IC50 value of 3.27 μM for the lead compound 16a. The dataset is 284.5 KB in size and includes PDB files.
A figshare dataset by Wandong Liu, last updated in May 2026, contains Protein Data Bank (PDB) files related to covalent inhibitors of the New Delhi metallo-β-lactamase-1 (NDM-1) enzyme. The data supports research into reversing carbapenem antibiotic resistance, specifically for the inhibitor compound 16a derived from a cephalexin scaffold. The dataset size is 285.8 KB.
An inventory of public information assets generated, obtained, or controlled by an entity. The data includes a 'VIGENCIA ACTUALZIACION' column identifying the year assets were recorded from 2022 to 2025. It is published by www.datos.gov.co and was last updated on 2026-05-26.
Salinity data collected by weather sensors deployed on the NERP Weather Station site on Masig Island. The data covers a period from 02 May 2014 to 26 December 2014. It was published by the Australian Ocean Data Network.
NASA's Orbiting Carbon Observatory-2 (OCO-2) Level 0 spacecraft attitude data provides pointing angles for each orbit. This data is generated from APID 20 telemetry and an Orbit Boundary File and is essential for determining the precise geolocation of the mission's science measurements of atmospheric carbon dioxide. The dataset supports the calibration of three high-resolution spectrometers measuring reflected sunlight at specific near-infrared wavelengths.
U-Pb geochronology data for carbonates from five Copper (Cu) deposits in the Neuquen Basin, Argentina: Los Chihuidos, Barda González, La Cuprosa, El Porvenir, and Tordillos. The dataset was collected for a thesis titled 'Constraining Cu-(Co) mineralisation in sediment-hosted copper deposits using rutile, apatite, and carbonate geochronology' at the University of Southampton. Context of samples and methods are described in Chapter 6 of the associated thesis.
A single-day hail event on February 8, 2012, was recorded by weather sensors deployed on Thursday Island. The data was collected by the NERP Weather Station and is aggregated by the Australian Ocean Data Network. The dataset was last updated in June 2026.
Spatial Services (DCS) provides a geospatial dataset of Special and Controlled Areas designated under the Sydney Water Catchment Management Act 1998. The dataset covers approximately 364,000 hectares of land around water storages supplying Sydney, the Illawarra, Blue Mountains, Southern Highlands, and Shoalhaven regions. It was last updated on 2026-05-17.
NASA's Pre-Delta-X campaign dataset provides water level profiles derived from AirSWOT Ka-band radar interferometry. Measurements were taken in May 2015 along the Wax Lake Outlet within the Mississippi River Delta floodplain in Louisiana. This Level 3 dataset contains numerous profiles of water-surface elevation and associated uncertainty.
From May 8, 2018, to June 3, 2021, humidity data was collected by weather sensors deployed at the NERP Weather Station site Badu. The dataset is hosted by the Australian Ocean Data Network and was last updated on June 16, 2026.
A southern African subset of the Global Land One-Kilometer Base Elevation (GLOBE) digital elevation model provides terrain data in ASCII GRID and binary image formats. The dataset has a global resolution of 30 arc-seconds, resulting in a grid of 21,600 rows by 43,200 columns, with elevation values ranging from -407 to 8,752 meters on land. It was produced by the ORNL_CLOUD organization, with metadata indicating updates in 1999 and 2026.